Application No.: 10/607,809 7 Docket No.: 300622004810 

REMARKS 

Claims 1 through 19 are pending in the application of which claims 3 through 6 have 
been withdrawn from consideration. Claim 2, and corresponding dependent claims 10 through 19, 
have been rejected under 35 U.S.C. §101 . Claim 1, and corresponding dependent claims 2, and 7 
through 19, have been rejected under 35 U.S.C. §112. 

Amendments to the specification 

The paragraph beginning on the last two lines of page 34 and continuing on page 35 has 
been amended to remove a reference to a website. 

Amendments to the claims 

Claim 1 has been amended to clarify the invention. Support can be found in the 
specification, inter alia, at page 8, lines 13-18, and in the claims as originally filed. Claim 2 has 
been amended to clarify the invention. Support can be found, inter alia, in the claims as originally 
filed. 

Claims rejected under 35 U.S.C. §101 

The Examiner has stated that "[c]laims 2 and 10-19 are drawn to a host cell, which reads 
on a product of nature." The Applicant has previously argued that claim 2 does not describe a 
product of nature, because the host cell comprises recombinant genes, which are not products found 
in nature. As defined in the specification (p. 6, lines 2-3), ". . .the term recombinant refers to a cell, 
compound, or composition produced at least in part by human intervention." 

Nonetheless, the Examiner remains concerned that claim 2 reads on a product of nature. 
Though Applicant does not necessarily agree with the Examiner, Applicant has amended claim 2 to 
address her concerns by inserting the phrase "an isolated" in front of "host cell". Accordingly, 
Applicant respectfully requests that the Examiner withdraw the 35 U.S.C. §101 rejection of 
amended claim 2, and the corresponding dependent claims 10 through 19. 



sd-357877 



Application No.: 10/607,809 



8 



Docket No.: 300622004810 



Claims rejected under 35 U.S.C. §112 

Claim 1, and corresponding dependent claims 2, and 7 through 19, have been rejected 
under 35 U.S.C. §1 12 as "containing subject matter which was not described in the specification in 
such a way as to reasonably convey to one skilled in the relevant art that the inventor(s), at the time 
the application was filed, had possession of the claimed invention." 

In response, Applicant previously clarified that the subject matter at issue were known 
structures at the time of the invention. Therefore, where the prior art teaches what the applicant 
refers to, no sequence or structural description is needed in the specification, let alone in the claims. 
Neither recitation of the sequences nor incorporation by reference is necessary where what the 
application refers to is known. 

To further address the Examiner's concerns, Applicant offers the following references to 
illustrate that the biological sequences used in the claimed invention were known individually in the 
art at the time of the invention: 



Claim Element 


GenBank Reference 


KS g domain 


GenBank AF016585 (12/7/1997) 


AT domain 


GenBank AF016585 (12/7/1997) 


ACP domain 


GenBank X56107 (10/14/1991) 


ccr 


GenBank accession no. U67612 (3/31/1998) 


icm 


GenBank accession no. U37135 (3/27/1996) 



Additionally, scientific literature references include: 
KS® domain 

Kakavas, S. J., et al 9 "Identification and Characterization of the Niddamycin 
Polyketide Synthase Genes from Streptomyces caelestis" Journal of 
Bacteriology, Dec. 1997, pp. 7515-7522. (Exhibit A) 



sd-357877 



Application No.: 10/607,809 



9 



Docket No.: 300622004810 



AT domain 

Stassi, D. L., et al 9 "Ethyl-substituted erythromycin derivatives produced by 
directed metabolic engineering," Proc. Natl. Acad. Sci. USA , Vol. 95, pp. 
7305-7309, June 1998. (Exhibit B) 



ACP domain 

Donadio, S. and Katz, L., "Organization of the enzymatic domains in the 
multifunctional polyketide synthase involved in erythromycin formation in 
Saccharopolyspora erythraea" Gene , 111 (1992) pp. 51-60. (Exhibit C) 

ccr 

Vrijbloed, J.W., et aL, "Insertional Inactivation of Methylmalonyl Coenzyme 
A (CoA) Mutase and Isobutyryl-CoA Mutase Genes in Streptomyces 
cinnamonensis: Influence of Polyketide Antibiotic Biosynthesis," Journal of 
Bacteriology , Sept. 1999 p. 5600-5605. (Exhibit D) 

icm 

Stassi, D. L., et al 9 "Ethyl-substituted erythromycin derivatives produced by 
directed metabolic engineering," Proc. Natl. Acad. Sci. USA , Vol. 95, pp. 
7305-7309, June 1998. (Exhibit E) 



Applicant respectfully requests that the Examiner considers the above references as 
supporting the Applicant's assertion that the individual claim elements of claims 1 and 2, and their 
corresponding dependent claims, were well known at the time that the application was filed. 
Accordingly, the Examiner is respectfully requested to withdraw the 35 U.S.C. §112 rejection. 



sd-357877 



Application No.: 10/607,809 



10 



Docket No.: 300622004810 



CONCLUSION 



In view of the # above, each of the presently pending claims in this application is believed 
to be in immediate condition for allowance. Accordingly, the Examiner is respectfully requested to 
withdraw the outstanding rejection of the claims and to pass this application to issue. 

If it is determined that a telephone conference would expedite the prosecution of this 
application, the Examiner is invited to telephone the undersigned at the number given below. 



other relief is required, applicant petitions for any required relief including extensions of time and 
authorizes the Commissioner to charge the cost of such petitions and/or other fees due in connection 
with the filing of this document to Deposit Account No. 03-1952 referencing docket no. 30062 
2004810. However, the Commissioner is not authorized to charge the cost of the issue fee to the 
Deposit Account. 

Dated: February 16, 2007 Respectfully submitted, 



In the event the U.S. Patent and Trademark office determines that an extension and/or 




Registration No.: 45,193 



MORRISON & FOERSTER LLP 
12531 High Bluff Drive, Suite 100 
San Diego, California 92130-2040 
(858)720-7961 



sd-357877 



Journal of Bacteriology, Dec. 1997, p. 7515-7522 
0021-9193/97/$04.00 +0 

Copyright © 1997, American Society for Microbiology 



Vol. 179, No. 23 



Identification and Characterization of the Niddamycin 
Polyketide Synthase Genes from Streptomyces caelestis 

STEPHAN J. KAKAVAS, LEONARD KATZ, and DIANE STASSI* 
Antibacterial Discovery Division, Abbott Laboratories, Abbott Park, Illinois 60064 

Received 29 August 1997/Accepted 26 September 1997 

The genes encoding the polyketide synthase (PKS) portion of the niddamycin biosynthetic pathway were 
isolated from a library of Streptomyces caelestis NRRL-2821 chromosomal DNA. Analysis of 40 kb of DNA 
revealed the presence of five large open reading frames (ORFs) encoding the seven modular sets of enzymatic 
activities required for the synthesis of a 16-membered lactone ring. The enzymatic motifs identified within each 
module were consistent with those predicted from the structure of niddamycin. Disruption of the second ORF 
of the PKS coding region eliminated niddamycin production, demonstrating that the cloned genes are involved 
in the biosynthesis of this compound. 



Niddamycin is a macrolide antibiotic which is able to bind 
SOS ribosomal subunits to inhibit protein synthesis. The com- 
pound was first discovered as a secondary metabolite of Strep- 
tomyces djakartensis (16) and was later found to be produced by 
Streptomyces caelestis NRRL-2821 (11a). The structure of nid- 
damycin (Fig. 1) suggests that the polyketide backbone of the 
macrolide ring is formed through the ordered condensation of 
carboxylic acid residues derived from acetate, propionate, bu- 
tyrate, and perhaps glycolate (24). The disaccharide, mycam- 
inose-isobutyrylmycarose, is attached to the macrolide ring at 
C-5. 

Macrolides belong to a class of molecules referred to as 
complex polyketides, which are synthesized on large, multi- 
functional enzymes called polyketide synthases (PKSs). The 
synthesis of polyketides is mechanistically similar to that of 
fatty acids; however, a greater variety of starter and extender 
carboxylic acid residues are incorporated into the growing 
polyketide chain, and the p-keto groups formed after each 
condensation step undergo various degrees of reduction (15, 
20). 

PKSs, in general, contain all of the enzymatic activities nec- 
essary for the sequential condensation of acyl thioesters ((3- 
ketoacyl acyl carrier protein synthases [KS]), acyltransferases 
[AT], and acyl carrier proteins [ACP]), the subsequent reduc- 
tion of the p-keto groups (dehydratases [DH], enoylreductases 
[ER], and ketoreductases [KR]), and the release of the com- 
pleted chain from the PKS (thioesterases [TE]). Analysis of the 
erythromycin PKS genes revealed that these enzymatic do- 
mains are organized into modules, each of which is responsible 
for one round of condensation and reduction (5, 7, 9, 10). As 
a. result, there is a direct correlation between the number of 
modules contained within the erythromycin PKS and the 
length of the polyketide chain. In addition, the genetic order of 
the, erythromycin PKS modules was found to be colinear with 
the order of biochemical reactions, allowing directed genetic 
alterations which produce predicted novel erythromycin deriv- 
atives (9, 11). 

...The polyketide portion of the 16-membered macrolide nid- 
jdamycin is predicted to be synthesized by a complex (type 1) 
PKS (15) comprising seven modules, each catalyzing one con- 



' " * Corresponding author. Mailing address: Abbott Laboratories, D- 
47P AP-9A, 100 Abbott Park Road, Abbott Park, IL 60064. Phone: 
(847) 938-7389. Fax: (847) 938-3403. E-mail: stassid@ugenel.pprd 
*abbott.com. 



densatidn reaction. It had previously been suggested that the 
choice of the extender coenzyme A (CoA)-thioester is deter- 
mined by the AT domain contained in each module (9). Se- 
quence comparisons of AT domains for fatty acid and 
polyketide synthases have revealed specific sequence motifs for 
malonyl- and methylmalonyl-ATs (m- and mmATs) (13). The 
chemical structure of niddamycin (Fig. 1) suggests that the 
PKS should contain ATs specific not only for malonyl-CoA 
(modules 1, 2, 3, and 7) and methylmalonyl-CoA (module 4) 
but also for the more rarely encountered ethylmalonyl-CoA 
(module 5) and for an as yet undetermined Co A derivative 
which results in the insertion of hydroxymalonate into the 
growing chain (module 6). In addition, the structure predicts 
the enzymatic motifs which should be present in each module 
to give the corresponding reduction state of the p-carbonyl 
group formed after each condensation. 

This study describes the isolation and characterization of the 
PKS genes responsible for niddamycin biosynthesis, including 
those AT sequences which may result in the introduction of 
unusual side chains to the macrolide ring. 

MATERIALS AND METHODS 

Media and reagents. SeaKcm-agarosc was obtained from FMC BioProducts, 
Rockland, Maine. Bacto Soytone, soluble starch, yeast extract, Bacto Agar, and 
Antibiotic Medium U (AM 11) were obtained from Difco Laboratories, Detroit, 
Mich. SGGP medium (34) and AS-1 medium (4) have been described previously. 
SCM medium contains (per liter) the following: Bacto Soytone, 20 g; soluble 
starch, 15 g; morpholinepropanesulfonic acid 103 g; yeast extract, 1.5 g; and 
CaCljj 0.1 g. R3M plates consist of (per liter) the following: sucrose, 103 g; 
K 2 S0 4 , 0.25 g; yeast extract, 4 g; Casamino Acids, 4 g; tryptone, 4 g; agar, 22 g; 
and H 2 0, 830 ml. After sterilization, 20 ml of a 2.5 M solution of MgCl* 20 ml 
of a 50% glucose solution, 20 ml of a 2.5 M solution of CaCli, 12.5 ml of a 2 M 
solution of Tris-HCJ (pH 7.0), 0.2 ml of a 5,000x concentrated trace element 
solution (14), 25 ml of a 1 M solution of NaOH, and 0.37 ml of a 0.5 M solution 
of KH 2 P0 4 were added. 

Strains, bacteriophage, and plasmids. Escherichia coli DH5a cells and E. coli 
XLl-Blue MR cells were purchased from BRL/life Technologies, Gaithcrsburg, 
Md., and Stratagene Cloning Systems, La Jolla, Calif., respectively. E. coli S17-1 
was obtained from Philippe Mazodier, Institut Pasteur, Paris, France. Bacterio- 
phage T7 was obtained from Phil Youngman, University of Georgia, Athens. 
Plasm id pGEM-3Zf was purchased from Promega Corporation, Madison, Wis. 
Plasmids pUC19 (35), pKCU39 (6), and pNJl "(31) have been described previ- 
ously. 

Nucleic acid isolation and manipulation. Plasmid isolation from £. coli was 
performed by using a Qiagen Midi kit and Qiaprep Spin Plasmid kit (Qiagen 
GmbH, Hilden, Germany). DNA cloning and manipulations were performed by 
using standard procedures (27). Restriction cniymes, T4 DNA ligasc, calf intes- 
tinal alkaline phosphatase (CLAP), and the KJcnow fragment of DNA polymer- 
ase I were purchased from BRL/Ufc Technologies. Hybond-N membranes were 
obtained from Amersham Corporation, Arlington Heights, 111. 



7515 



7516 KAKAVAS ET AL, 



J. Bacteriol. 




Mod 7 



FIG. 1. Structure of niddamycin. The atoms of the macrolide ring are num- 
bered. The thick lines indicate carbon atoms contributed by the indicated mod- 
ules (Mod). 



Construction of a Streptomyces caelestis genomic library. £ caelestis DN A was 
prepared by partially digesting 5 u-g of DNA with SaulllA and size selecting 
fragments of around 40 kb by electrophoresis through a 0.4% agarose gel. 
Cosmid pNJl arms were prepared by digestion of the vector with EcoRl, de- 
phosphorylation with CIAP, and then digestion with Bglll to generate one arm 
arid digestion of the cosmid with HindlU, dephosphorylation with CIAP, and 
then digestion with BglU to generate the other. Ligation of 1 u.g of cosmid arms 
to 1 jig of the size-selected 5. caelestis DNA was in a 20- u.1 volume. Two 
microliters of this ligation mix was packaged by using Gigapackll XL (Strat- 
agene) as instructed by the manufacturer, and E. coli X LI -Blue MR cells were 
hosts for infection. Individual colonies were transferred to 30 96-well plates 
(Costar, Cambridge, Mass.). 

. 5. caelestis PKS probe. A PKS-specific probe was generated by PCR amplifi- 
cation of S. caelestis genomic DNA, with degenerate primers designed from KS 
and AT sequences in the GenBank/EMBL database. The KS-specific oligonu- 
cleotide (5'-CGGTSAAGTCSAACATCGG-3') and the AT-specific oligonucle- 
otide (5 '-GCRATCTCRCCCTGCGARTG-3') were used in a PCR mixture 
containing ThermoPol reaction buffer (New England Biolabs, Beverly, Mass.), 
0*2 mM deoxynucleoside triphosphate mixture, 0.5 jxg of genomic DNA, and 100 
pmol of the primers in a volume of 98 jxl. After incubation at 96°C for 2 min, 2 
U of Vent DNA polymerase (New England Biolabs) was added, and the mixture 
was cycled 25 times in a Perkin-Elmcr Cctus 9600 thcrmocycler at 96°C for 30 s, 
50°C for 2 min, and 72*C for 4 min, followed by a final incubation at 72°C for 15 
min. The reaction generated a 900-bp DNA fragment which was gel purified 
from a 1% agarose gel by using a Prcp-A-Gcnc kit (Bio-Rad Laboratories, 
Hercules, Calif.). 

. Hybridization of the PKS probe. The S. caelestis library was transferred from 
96-well plates to Hybond-N membranes for hybridization. The membranes were 
soaked in pre hybridization solution (27) for at least 2 h at 65°C. About 50 ng of 
the KS/AT probe was labeled with [a-"P]dCTP (DuPont, NEN Research Prod- 
ucts), using the Megapriroe DNA labeling system (Amersham) as instructed by 
the manufacturer. The radiolabeled probe was added to the prehybridization 
solution and incubated at 65*C for 16 to 20 h. The membranes were then washed 
twice in lx SSC (lx SSC is 0.15 M NaCl plus 0.015 M sodium citrate)-0.1% 
sodium dodecyl sulfate at room temperature and once in 0.1 x SSC-0.1% sodium 
dadecyl sulfate at 65°C for 20 min. The membranes were dried and placed on 35- 
by 43-cm phosphor screens (Molecular Dynamics, Sunnyvale, Calif.) for 2 to 4 h. 
Imaging of the screens was on a Molecular Dynamics Phosphorlmager 425. 
. DNA sequencing. Cosmids that hybridized to the KS/AT probe were digested 
With Sstl, and the resulting fragments were subcloned into vector pGEM-3Zf or 
P.UC19. Sstl fragments that were larger than 8 kb were further digested with 
$ma\ and subcloned to generate plasm ids that would be more suitable for 
sequencing. Two methods were used for DNA sequencing. One method used 
[a- 33 P]dCTP (DuPont, NEN Research Products) and the fraole DNA cycle 
sequencing system (Promega). Samples were run on 6% polyacrylamide-8 M 
urea gels, using a Sequi-Gen II sequencing apparatus (Bio-Rad). The second 
method used an ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction 
kit. (Perkin-Elmer Corporation), and the samples were run on 4.75% polyacryl- 
aroide-8.3 M urea gels on an Applied Biosystems 373 sequencer. All subclones 
were sequenced on both strands by using multicloning site primers to initiate the 
sequences and internal primers to extend the sequences. 

. DNA sequence data was analyzed by using the Wisconsin sequence analysis 
package programs (Genetics Computer Group, Madison, Wis.) (8). Database 
searches to identify homologs to the deduced amino acid sequences were per- 
formed with BLAST (1). 



Conjugation. S. caelestis recipient cells were prepared by inoculating 30 ml of 
SGGP with 25 u.1 of spores and incubating the culture overnight at 30°C on a 
rotary shaker. The culture was then centrifuged at a relative centrifugal force of 
2,190 for 15 min. The pelleted cells were washed once in SGGP, recentrifuged, 
and resuspended in 2 ml of SGGP. E. coli donor cells, SI 7-1 and S17- 
l(pSK7595), were prepared by growing overnight cultures in 2 ml of LB (with 
apramycin [50 u.g/ml] where appropriate) at 37'C on a rotary shaker. Two 
hundred microliters of the E. coli cultures was inoculated into 30 ml of fresh 
medium and grown for about 3 h (to raid-log phase) at 37°C. The cells were 
pelleted by centrif ligation for 20 min and washed once in the same medium. 
After a final centrifugation, the pellets were resuspended in 300 jil of LB. 
Nucleopore membranes (25 mm, 0.2 p,m; VWR) were placed onto an AS-1 agar 
plate and spotted with 50 p.1 each of donor and recipient cells. The plates were 
incubated at 30°C for 3 h, and then each membrane was placed into a 50- ml 
conical tube containing 5 ml of SGGP and vortexed for 30 s to dislodge the cells. 
The cells were centrifuged for 20 min, and the pellets were resuspended in 100 
uJ of SGGP. Cells were then plated onto AS-1 plates and incubated overnight at 
30°C. The next day, the plates were overlaid with 2 ml of distilled H 2 0 containing 
30 u.1 of a 50-mg/ml solution of apramycin and 150 u.1 of T7 bacteriophage (- 10 l " 
PFU) and returned to 30°C until transconjugants appeared (about 7 days). 

Metabolite analysis. Culture supernatants of cells grown for 3 days at 30°C in 
SCM medium were adjusted to pH 9.0 with NH 4 OH and extracted two times 
with equal volumes of ethyl acetate. The organic phases were pooled and con- 
centrated. Extracts were evaluated on silica gel thin-layer chromatography 
(TLC) plates (60F-254; Merck), which were developed with isopropyl cthcr- 
methanol-NH 4 OH (75:35:2). Compounds were visualized by spraying the plates 
with anisaldchydc-sulfuric acid-ethanol (1:1:9) and heating the plates with a hot 
air gun until color developed. Biological activity was assessed by placing un- 
stained TLC plates onto AM 11 agar plates seeded with Staphylococcus aureus as 
the indicator strain and incubating the plates overnight at 37°C to develop zones 
of inhibition. 

Nucleotide sequence accession number. The GcnBank accession number for 
the niddamycin PKS and portions of two flanking genes is AF016585. 



RESULTS 

Cloning and sequencing of the niddamycin PKS cluster in 5. 

caelestis. A strategy was devised to isolate the PKS genes of the 
niddamycin biosynthetic pathway by using sequence conserva- 
tion in the KS and AT regions encoded by previously se- 
quenced PKS genes (5, 10). A pair of degenerate primers 
spanning conserved regions was designed from sequences de- 
posited in GenBank (see Materials and Methods) with the 
expectation of amplifying, from S. caelestis chromosomal 
DNA, most or all of the genes for the seven KS/AT regions 
predicted to be found in the niddamycin PKS. The product of 
the PCR ran as a 900-bp fragment on an agarose gel (data not 
shown) and was subsequently cloned into M13 for sequencing. 
Seven clones were analyzed, and all contained the same se- 
quence, which was later found to correspond to the KS/AT 
region in module 5 of the niddamycin cluster (see below). This 
PCR product was used to probe chromosomal DNA to deter- 
mine whether it had sufficient homology to hybridize to the 
remaining KS/AT domains of the niddamycin biosynthetic 
cluster and therefore could be used to identify cosmids in a 
library of S. caelestis DNA which encoded the remainder of the 
pathway. When DNA from S. caelestis was digested with Sstl 
and probed with the PCR fragment, at least seven hybridizing 
bands were visible (data not shown). A second Southern hy- 
bridization, but with a 1.45-kb Sstl-Mscl fragment (coordinates 
10793 to 12244; GenBank accession number M63677) encod- 
ing part of the KS domain from module 5 of the erythromycin 
PKS, gave the same banding pattern, suggesting that these 
bands may represent all of the type 1 PKS genes present in S. 
caelestis. 

To construct the S. caelestis library, a partial SaulllA digest 
of genomic DNA which had been size selected for 30-kb frag- 
ments was ligated to cosmid pNJl arms and propagated in E. 
coli XL-1 Blue MR cells. The library of 2,880 clones was 
screened by colony blotting with the 5. caelestis KS/AT probe 
described above. Nineteen clones gave a strong hybridization 
signal, and restriction digests of the positive clones were ana- 



Vol. 179, 1997 



NIDDAMYCIN POLYKETTDE SYNTHASE GENES 7517 



>10kb 

SJ&kb 



IS kb. 
ISkb 




pCEL13f5 



FIG. 2. Southern analysis of positively hybridizing cosmids. Nineteen cos- 
mids that hybridized with the KS/AT probe were digested with Sstl and analyzed 
by Southern hybridization using a KS/AT probe generated from S. caelestis DNA. 
Three of the cosmids did not cross-react (i.e., were false positives). Lanes con- 
taining cosmid DNA specifically referred to in the text are labeled. 



lyzed by Southern hybridization. Banding patterns of Stfl-di- 
gested cosmid DNAs probed with the same DNA indicated 
that 16 of the 19 clones which were positive in colony blots 
were also positive by Southern analysis and that 15 of these 
clones contained overlapping inserts (Fig. 2). Three of the 
clones contained six hybridizing bands, while the other 12 
overlapping clones contained from two to five of the same 
hybridizing bands. Clone pCEL18h5, with six hybridizing 
bands, was chosen for sequence analysis. Two other clones, 
pCEL13f5 and pCEL13el2, not only shared a large hybridizing 
fragment (>10 kb) with pCEL18h5 but also contained a" 
unique 2.4-kb fragment, indicating that these cosmids could 
contain PKS sequences not present in pCEL18h5. Another 
related clone, pCELlOall, contained the 2.4-kb fragment but 
not the larger fragment, and so it was not analyzed further. The 



large Sstl fragment was isolated from clones pCEL13f5 and 
pCEL18h5 and further digested with Smal for Southern anal- 
ysis. Probing indicated that both clones contained hybridizing 
bands of 2.1, 2.2, and 3.2. kb, but clone pCEL13f5 also con- 
tained a 1.9-kb hybridizing band not found in clone pCEL18h5. 
This was further evidence that clone pCEL13f5 shared DNA 
regions with clone pCEL18h5 but also contained neighboring 
PKS sequences. For this reason, clone pCEL13f5 was also 
analyzed along with clone pCEL18h5. 

Restriction fragments from cosmids pCEL18h5 and 
pCEL13f5 (Fig. 3) were subcloned into pUC19 or pGEM-3Zf 
and sequenced as described in Materials and Methods. Orien- 
tation of the subcloned fragments was determined by using the 
parent cosmid as the template for primers that annealed to the 
5' and 3' ends of each insert, generating upstream and down- 
stream sequences. These could then be aligned with sequences 
of the individual subclones to order them within the cosmid. 
Each cosmid was found to contain approximately 30 kb of 
insert DNA, with around 10 kb of overlapping sequence. In all, 
41.1 kb of DNA was sequenced. 

Organization of the niddamycin gene cluster. DNA se- 
quence data obtained from cosmids pCEL18h5 and pCEL13f5 
were analyzed for open reading frames (ORFs) by using a 
Streptomyces codon usage table (33). Five large ORFs (ORFs 
1 to 5) spanning 40,012 bp of DNA were revealed (see below). 
These large ORFs were bounded by smaller, partially se- 
quenced ORFs (ORFs 6 and 7) located at the 5' and 3' ends of 
ORFs 1 to 5, respectively (Fig. 3). The deduced N-terminal 
132-amino-acid sequence encoded by ORF 6 was found, by 
BLAST search, to have 61% identity to the N-terminaJ region 
of the product of the tyll gene, which encodes a cytochrome 
P450 hydroxylase in the biosynthetic pathway of the 16-mem- 
bered macrolide tylosin (19). Like the gene containing ORF 6, 
tyll in Streptomyces fradiae is also located immediately 5' to, 
and reads in the opposite direction from, the genes encoding 
the PKS for tylosin production. The deduced N-terminal se- 
quence encoded by ORF 7 (157 amino acids) was found to 



MODULE MODULE MODULE MODULE MODULE MODULE MODULE 

1 2 3 4 5 6 7 
I II II II II II II 



Enzymatic 
domains 



Niddamycin 



sKS* KS 
SAT I AT 



KS. 



IT 



KS. 



KS._ KS 



KR KR 

Tr TV 



1 



KS . 



I MP 



ORF6 



ORF1 
(nidAl) 



ORF2 
(nidA2) 



ORF3 
(nidA3) 



» » 

ORF4 ORP5 QRF7 
(nidA4) {nidAS) 



SsU 
;rnents 



1.0 1.6 S.6 

2.4 2.8 2.8 



22 



1 .6 .05 



, S raal 
rxagmcnis 
(kb) 



3.2 2.1 02 0^8 

OA TT" r?" 1.9 



pCELl8h5 



P CELl3f5 



PlG. 3. Organization of the niddamycin PKS genes. Cosmids pCEL18h5 and pCELUfS were isolated from a genomic library of S. caelestis DNA. DNA sequence 
was obtained from subcloned restriction fragments indicated by narrow lines and from cosmid DNA to link the restriction fragments. ORFs, enzymatic domains, and. 
modules were identified from DNA sequence analysis. Domains labeled with asterisks are probably inactive. 



7518 KAKAVAS ET AL. 



TABLE 1. Niddamycin PKS sequence coordinates 



nor 
wrvr 

no. 


Nucleotide 
■ coordinates 


Mn rtf 
INO. OI 

amino acids 


Homology 


6 


1-396 


132 


P450 (N-terminal portion) 


1 


02 /-li04o 


434U 


PKS (loading domain, modules 1 








and 2) 




69O-1904 


405 


KS* (inactive) 




2235-3242 


336 


ATs 




3312-3572 


87 


ACPs 




3645-4916 


424 


KS 




5214-6215 


334 


AT 




oyUU— /44Z 


lol 


VT> 
IVK 




7695-7955 


87 


ACP 




8025-9296 


424 


KS 




9594-10595 


334 


AT 




10650-11225 


192 


DH 




12219-12770 


184 


KR 




13068-13325 


86 


ACP 


2 


13693-19209 


1839 


PKS (module 3) 




13795-15066 


424 


KS 




15379-16380 


334 


AT 




16423-16992 


190 


DH 






1 


Kis. 




18676-18933 


86 


ACP 


3 


19269-30230 


3654 


PKS (modules 4 and 5) 




19374-20645 


424 


KS 




20973-22007 


345 


AT 




ZZ53U-z3Uj7 


1 /o 


KK (inactive ) 




23322-23579 


86 


ACP 




23646-24926 


427 


KS 




25293-26315 


341 


AT 




26355-26909 


185 


DH 




27789-28835 


349 


ER 




28842-29393 


184 


KR 




zyo/v— zyvio 


OO 




4 


30294-35000 


1569 


PKS (module 6) 




30423-31691 


423 


KS 




32013-33038 


342 


AT 




33786-34337 


184 


KR 




34593-34853 


87 


ACP 


5 


35000-40636 


1879 


PKS (module 7, TE) 




35120-36388 


423 


KS 




36692-37723 


344 


AT 




38633-39184 


184 


KR 




39455-39712 


86 


ACP 




39824-40636 


271 


TE 


7 


40626-41097 


157 


Af-Methyltransferase (N-terminal 



portion) 



have 68% identity to the product of ORF 3*, an Af-methyl- 
transferase of the (yMfJ -encoded region of the tylosin biosyn- 
thetic cluster (12). Like ORF 7, ORF 3* is downstream of and 
reads in the same direction as the genes encoding PKS; how- 
ever, it is not immediately adjacent to the genes encoding PKS. 
Based on sequence alignments and distance from potential 
ribosome binding sites (RBSs) (30), ORFs 6 and 7 appear to 
use ATG as the start codon (Table 1), with the putative RBSs 
being GGAGA and GGAGG, respectively. In addition, ORF 7 
may be translationally coupled to ORF 5, since the putative 
ATG start codon of ORF 7 begins 11 bases upstream of the 
TGA stop codon of ORF 5. 

The larger ORFs (ORFs 1 to 5) were found to encode a PKS 
comprising proteins with molecular masses of 458, 194, 379, 



J. Bacteriol. 

167, and 198 kDa, respectively. The genes corresponding to 
these ORFs were designated nidAl through nidA5> respec- 
tively. nidAl, nidAl, and nidAS appear to begin with ATG 
codons (Table 1) and have the putative RBSs GGAGG, 
GGGGG, and GGAG, respectively. nidA3 and nidA4 appear 
to begin with GTG codons, with the RBSs being GGAGA and 
GGAAG, respectively. The enzymatic motifs encoded by these 
genes (i.e., KS, AT, DH, ER, KR, and ACP) were delineated 
by BLAST searches which identified homology to other PKS 
genes deposited in the EMBL, GenBank, and Swissprot data- 
bases. As is characteristic of type I PKSs, the motifs within 
each ORF product are organized into modules. Seven mod- 
ules, corresponding to the seven condensation reactions pre- 
dicted to be required for niddamycin biosynthesis, were found 
in the PKS; NidAl and NidA3 contain two modules each, 
whereas NidA2, NidA4, and NidA5 each contain one module 
(Fig. 3). In addition, the sets of motifs contained within each 
module were also consistent with the extent of p-carbon pro- 
cessing at the corresponding positions of the niddamycin ring 
(Fig. 1 and 3). Besides the KS, AT, and ACP domains found in 
all modules, modules 2 and 3 have the KR and DH domains 
required for the formation of the double bonds found between 
carbons 12 and 13 and carbons 10 and 11, respectively. Module 
5 has a full complement of domains (KS, AT, DH, ER, KR, 
and ACP) to produce the methylene group at C-7. Modules 4, 
6, and 7 each have KR domains which would result in hydroxyl 
groups at C-9, C-5, and C-3, respectively. However, sequence 
analysis of the KR in module 4 indicates that it is inactive (see 
below), resulting in the keto group at C-9. A TE domain with 
the conserved GxSxG and GdH motifs (10) was found at the 
carboxy-terminal end of NidG5 (module 7), identifying the end 
of the PKS cluster. 

In contrast to the erythromycin PKS, where the N-terminal 
enzymatic motifs are a loading AT and ACP, the loading AT 
and ACP of the niddamycin PKS follow a KS domain which is 
not predicted to be necessary for the initiation of polyketide 
synthesis. Homology of this motif with the other KS domains in 
the cluster ranges from only 45 to 47% identity, whereas ho- 
mologies among the KS domains of modules 1 to 7 range from 
70 to 92% identity. More importantly, sequence analysis indi- 
cates that this KS is enzymatically nonfunctional, as a critical 
cysteine residue in the motif TVDTGCSSSLV, which is highly 
conserved among KS domains (3, 10), is replaced by a glu- 
tamine residue at position 173, forming a KS Q domain (Fig. 4). 
Interestingly, it has recently been reported that KS Q domains 
are also found at the N termini of the PKSs which synthesize 
the other 16-membered macrolides, carbomycin, spiramycin, 
and tylosin (17). The function of this motif, if any, remains to 
be determined. 

All of the remaining seven KS domains of this cluster retain 
the conserved active-site cysteine residue. In addition, two 
highly conserved His 308/347 residues (3, 11) are present in all of 
the niddamycin KS domains, including the unusual starter KS 
(Fig. 4). 

The niddamycin AT domains show more sequence variabil- 
ity than the KS domains. For example, AT2 and AT6 only have 
30% amino acid identity, whereas AT2 and AT3 have 95% 
amino acid identity. It has been demonstrated that mATs and 
mmATs fall into distinct families based on amino acid se- 
quence and can be distinguished by conserved motifs (28). As 
a result, it may be possible to predict AT substrate specificity 
on the basis of primary sequence. A PILEUP/DENDRO 
GRAM analysis comparing the niddamycin ATs with those 
from the rapamycin and erythromycin PKS clusters is shown in 
Fig. 5. This analysis demonstrates that the loading AT, as well 
as those of modules 1, 2, 3, and 7, fall into the group of ATs 



Vol. 179, 1997 



NIDDAMYCIN POLYKETIDE SYNTHASE GENES 7519 



KS 



KidKSl 
NidKS2 
NidKS3 
NidKS4 
NidKS7 
NidKS6 
NidKSS 
NidKSs* 



DH 



NidDH2 
NidDHS 
NidDH3 



KR 



NidKR4* 

NidKR6 

NidKR7 

NidKR3 

NidKR5 

NidKR2 

NidKRl 



KR 



NidERS 
EryER4 



ACP 



NidACP4 
NidACP7 
NidACP6 
HidACP3 
NidACPS 
NidACP2 
NidACPl 
NidACPs 



TB 



NidTE 
EryTE 



171 305 344 

TACSSSLVAL VEAHGTGTAL NIGHTQAAAG 

TACSSSLVAL VEAHGTGTPL NIGBSQAAAG 

TACSSSLVAL VEAHGTGTPL NIGHTQAAAG 

TACSSSLVAL VEAHGTGTPL NIGHTQAAAG 

TACSSSLVAL VEAHGTGTPL NIGHTQAAAG 

TACSSSLVAL VEAHGTGTPL NIGHTQAAAG 

TACSASLTAL VEAHGTGTHL NIGHTQAAAG 

TGQSSSLVAV VELHGTGTPA NIGHLEGAAG 



34 

VSLAAHPWLA DHTVAOAVLF PGTAFADLLL 
IGVDEFPWMA DHTLLOAVLL PGAAFADLAL 
LSARALPWLA DBLVWDRGVA PGTAVLEIVL 



GTVLWGDST 
GTILITG0TO 
GTILITGOTO 
GTVLITGQTO 
GTVLITGOTO 
GTVLLTGCTO 
GTVLITGffTfl 



154 



TVPARLLRSL 
ALOTHIATWL 
ALOTHIATWL 
ALAAVTARHL 
TIOSRIARHL 
ALGRRVAVHL 
ALfiSQV&RRL 



LD.DGAERW 
AHK.GAKHLI 
AHK . GAXHLI 
VARHGARHLL 
VTRHGVRHLL 
ARRHGVRRKL 
AL.AGAPHLL 



GLVALAGLRR GETVLVHAAA OOVOMAAVQI AHHLGAQVLA 
ALHDLAGLRA GQSVLJgAAA QCrVOM AAVAL &RRAGAEVLA 



35 

STRAFRELGF 
ARASFRDLGP 
HDVAFKDLGF 
AERSFKDAGF 
SDRAFREAGF 
PERPFHEIGF 
AETAFRAGGF 
LDRPFTSQGL 

101 

PFALAOHSAG ANVAYALAAY DVPONHFTML 

PFWAOHSAO ALMAYALATE- AVPGDHFTMV 



ESLTAVELRG 
DSLAAVSLRD 
D8LAAVKMRT 
DSLTAVDLRN 
DSLTAVELRN 
DSLATLELRN 
DSLTLVELRN 
DHMTAVELAG 



RLAEATGLTL 
GLAEATGLEL 
RLRETTGLDL 
RLNARTGLRL 
RLAAATGLRL 
RLGRLVGLRL 
RLASATGLRL 
LLGTAAGVAL 



FIG. 4. Conserved active sites of the niddamycin PKS domains. Deduced 
protein sequences were aligned by using the PILEUP/PRETTY program. Only 
the regions containing the proposed active sites arc shown. The numbers corre- ' 
spond to the amino acid position from the first residue in the first motif listed. 
The boldfaced amino acids arc conserved residues or motifs important for the 
function of each domain. Active-site residues arc marked with asterisks. Motifs 
important for functioning of the KR and ER domains arc underlined. For 
comparative purposes, the erythromycin ER from module 4 and the erythromy- 
cin TE arc included. 




methylmalonyl AT 

class 



- rep ATI 4 



malonyl AT 
class 



FIG. 5. Dendrogram analysis of AT domains. The dendrogram shows the 
sequence similarity of AT domains from the niddamycin, erythromycin, and 
rapamycin PKS clusters. The niddamycin AT domains are boldfaced. The ATs 
are grouped according to the substrate specificities of the rapamycin and eryth- 
romycin ATs (malonyl-CoA or methylmalonyl-CoA). 



that the AT in module 6 of the niddamycin pathway uses the 
same or a similar substrate. 

A closer examination of the amino acid sequence (Fig. 6) of 
the niddamycin ATs shows that the active-site residues Gin 6 , 
Ser 91 (of the GHSXG motif where Ser is involved in the for- 
mation of the acyl-enzyme intermediate), Arg 116 , and His 192 



MALONYL AT CLASS 



NidATl 
NidATl 
NidAT2 
NidATfl 
NidAT7 



1 58 89 

LLFTGQGAQH RTKYT QTALYRTL 

LLPTGQGAQH RTKTT . ..... .QTALYRTL 

LLFTGQGAQH RTKYT QTALYKTL , 

LLFTGQGAQH RTBYT QTALTRTX. , 

FLPSGQGAQR RTBTT •VlLffelf BHSIQGL— 



.Q.A.WLL. 



that use malonyl-CoA as the substrate. Based on the structure 
of niddamycin (Fig. 1), the acetate-derived methyl group at 
C-15 and the lack of methyl groups at C-14, C-12, C-10, and 
C-2 are consistent with those ATs being malonyl-CoA specific. 
On the other hand, the ATs of modules 4, 5, and 6 fall into the 
methylmalonyl-specific group. The methyl group at C-8 is con- 
sistent with AT4 being an mmAT. AT5, however, presumably 
uses ethylmalonyl-CoA as a substrate, resulting in the ethyl 
side chain at C-6. The fact that it does not segregate into a 
unique group in the dendrogram indicates that methylmalonyl 
and ethylmalonyl ATs may be relatively similar. The AT do- 
main in module 6 also falls into the methylmalonyl family, but 
its activity results in a hydroxyl or perhaps a methoxy group at 
C-4. The biosynthesis of leucomycin, a 16-membered macro- 
lide which contains a methoxy group at C-4, was examined by 
feeding the producing organism, Streptoverticillium kitasatoen- 
sis, 2- 13 C-labeled precursors (23, 24). Labeled malonate was 
not incorporated at carbons 3 and 4, indicating that malonyl- 
CoA was not a substrate for the corresponding AT. Labeled 
glycerol, however, was incorporated at C-4, suggesting that 
glycolate may be the substrate for the AT of the module cor- 
responding to the C-4 position of leucomycin. It is conceivable 



113 187 

NidATl ITARAHLMGQ VSHAFHBALM 

NidAT3 ITARAHLMGQ - - - VSHA7HS ALM 

NidAT2 ITARAHLMGQ VSHATRSALM 

MidATs ITARAHVMGQ— VSHRFBSALH 

NidAT7 VAARGRLMQR- 



MgTgy U itt T dO W YI* AT CLASS 



NidATS 
NidAT6 
NidAT4 



NidATS 
NidAT6 
WidATd 



1 58 89 

FVFPGQGSQW- — RVDW KV8IAAVW QB8QQEI - 

FVFPGQGAQW- - - RaEW MVSXAA1W GRSQOEV- 

FVFPGQGAQW- 



.as.s.AAhsr. 



113 190 

VALRSRAWLG VDtAGHfl PQV 

IAGR5RLWGR— VDfAOHflPQV 
VALRSQLI AR- - -VDyASHflRHV 



FIG. 6. Alignment of the sequences of conserved motifs found in AT do- 
mains. Sequences were aligned by using the PILEUP/PRETTY program. Bold- 
faced amino acids are conserved residues or motifs important for the function of 
AT domains. Active -site residues are marked with asterisks. Proposed consensus 
sequences for motifs of ATs specific for acetate and propionate (13) arc indi- 
cated. 



7520 KAKAVAS ET AL. 



J. Bacteriol. 



are present in all mATs and mmATs (3, 10). In addition, motifs 
which have been proposed to distinguish mATs from mmATs 
(13) seem to be more divergent in the niddamycin cluster (Fig. 
6, residues 58 to 77); however, these same stretches of amino 
acids, though they have different sequences, still seem to be 
specific for mATs and mmATs within the niddamycin se- 
quence. 

; KR domains were found in all seven modules. Homologies 
between the KR domains ranged from 31% amino acid identity 
between KR4 and KR6 to 99% amino acid identity between 
KR6 and KR7. It has been observed that modules correspond- 
ing to unreduced 0-carbonyls in the poryketide chains of eryth- 
romycin (9), rapamycin (3), and spiramycin (17) still contain 
KR motifs with, in most cases, obvious inactivating mutations. 
An inactive KR in module 4 would be consistent with the 
presence of the keto group at C-9 of niddamycin and would be 
another example of this type of organization. The consensus 
NADP(H) binding site of a KR is GxGxxGxxxA (29). The 
corresponding region of the KR4 domain of the niddamycin 
cluster is DxTxxPxxxL (Fig. 4), indicating that the KR is non- 
functional. All of the other KR domains in the cluster contain 
the expected NADP(H) binding motif with the exception of 
KR2. This KR is predicted to be active; however, a Cys residue 
is present at the first position of the NADP(H) binding site 
rather than a Gly. How this amino acid change would affect KR 
activity is uncertain. 

DH domains were identified in modules 2, 3, and 5. Hy- 
droxyl groups formed by KRs at C-13 and C-ll are reduced by 
DHs to form the double bonds between carbons 12 and 13 and 
carbons 10 and 11, corresponding to modules 2 and 3, respec- 
tively. Formation of the methylene group at C-7 (correspond- 
ing to module 5) also requires DH activity. DH2 and DH3 have 
39% amino acid identity, DH2 and DH5 have 45% identity, 
and DH3 and DH5 have 38% identity. The highly conserved 
residues His 45 , Gly 49 , and Pro 54 (5, 10) were present in DH2 
and DH5, but, surprisingly, in DH3 there is an Asp in place of 
the Gly residue (Fig. 4). Based on the niddamycin structure, 
this DH domain should be active, and so this divergence from 
consensus may not adversely affect the activity of the enzyme. 

The only ER domain present in the PKS cluster is in module 
5. An amino acid sequence comparison of ER5 with ER do- 
mains from the products of the erythromycin and rapamycin 
PKS genes (Fig. 4) reveals the presence of the putative 
NADP(H) binding motif LxHxg(a)xGGVG (2, 29, 32). 

ACP domains were identified in each of the modules and 
after the loading AT, as expected. Homologies between the 
ACP domains range from 32 to 67% amino acid identity. The 
pantotheine-binding Ser 46 residue (10) in the GFDSL motif 
was present in all of the ACP domains (Fig. 4). 

Gene disruption of the niddamycin PKS cluster. A gene 
disruption experiment was performed with S. caelestis to dem- 
onstrate that the PKS genes isolated from the genomic library 
were involved in niddamycin biosynthesis. A 1.5-kb 55/1 frag- 
ment containing AT- and DH-ehcoding sequences from the 
coding regions for module 3 (Fig. 2) was isolated from cosmid 
pCEL18h5, treated with Klenow fragment, and cloned into the 
EcoRV site of pKC1139 to generate pSK7595 (Fig. 7). This 
plasmid contains an apramycin resistance gene for selection, 
oHT for conjugal transfer, and both E. coli and Streptomyces 
origins of replication. The Streptomyces origin of replication from 
Streptomyces ghanaensis (21) functions only at temperatures 
below 34°C, and so under selective pressure at a nonpermissive 
temperature, the plasmid must integrate into the chromosome 
through homologous recombination to be maintained. Plasmid 
pSK7595 was first transformed into E. coli S17-1, the donor 
strain carrying the conjugation locus RP4. To transfer the 




Sad 



Ssa 



Digest with EcoRV 



^Ligate 



DH3 
-1.5 kb I 

Treat with Klenow 



Bgin 




Sad 



SacI 



Kpnl 



FIG. 7. Construction of pSK7595. The vector pKC1139 was used to construct 
the conjugation vector pSK7595. TheoriT gene allows for conjugal transfer from 
E. coli strains containing the RP4 locus. The apramycin resistance gene (Ap*) 
allows for selection of the plasmid. The temperature-sensitive Streptomyces ori- 
gin of replication (rep") and the E. coli origin of replication (rcp? uc ) are shown. 
The 1.5-kb SstI fragment containing AT- and DH-encoding sequences was 
cloned into the EcoRV site of pKCl 139 (black bar). 



plasmid to the niddamycin-producing strain, S17-1 transfor- 
mants were incubated with S, caelestis cells as described in 
Materials and Methods. As a negative control, S17-1 cells with- 
out plasmid were also incubated with 5. caelestis cells. The 
cultures were challenged with apramycin to eliminate non- 
plasmid-containing & caelestis cells. Nalidixic acid (50 u,g/ml) is 
routinely used in Streptomyces-E. coli conjugation experiments 
to eliminate the E, coli donor cells; however, S. caelestis was 
found to be sensitive to this antibiotic. Therefore, the conju- 
gation cultures were challenged with bacteriophage T7 in or- 
der to remove the E, coli cells. Cultures were incubated for 7 
days at 30°C, and in each of two separate experiments, only one 
S. caelestis colony was recovered. 

One putative transconjugant, S. caelestis(pSK1595), was 
grown at 37°C in apramycin-containing medium to force inte- 
gration of the temperature-sensitive plasmid. Southern analy- 
sis of genomic DNA indicated that the plasmid was present 
and had integrated into the appropriate location in the chro- 
mosome (Fig. 8). This strain was designated S. caelestis nidv. 
SK7595. Surprisingly, the original transconjugant isolated at 
30°C and never grown at a nonpermissive temperature also 
showed the pattern expected for integrated plasmid, indicating 
that this plasmid was not replicating at the permissive temper- 
ature in S. caelestis. This may also explain the extremely low 



Vol. 179, 1997 



NIDDAMYCIN POLYKETIDE SYNTHASE GENES 7521 



chromosome 




1 2 



Integration 



S § 



1—5.5 kb 



1—2.6 kb 



1—0.8 kb 



FIG. 8. Disruption of the niddamycin PKS locus. (A) Diagrammatic repre- 
sentation of the integration of pSK7595 into the S. caelestis chromosome. The 
region of homology between the plasmid and chromosome is represented by an 
open box. pKC1139 DNA is represented by a heavy line. Sstl sites are indicated. 
(B) Southern analysis of Stfl-digested genomic DNA probed with vector 
pKC1139. Lane 1, S. caelestis NRRL-2821 (wild type); lane 2. S. caeles- 
*ir(pSK7595). DNA fragment sizes based on migration of a l-kb ladder (BRL) on 
the same gel are indicated at the right. 



frequency of conjugation since transconjugants could be se- 
lected only if a rare integration event took place before the 
plasmid was lost. 

Spores of 5. caelestis md::SK7595 were plated onto R3M 
medium containing apramycin to generate single colonies. 
Three of the isolates were examined for the production of 
niddamycin. Cells were grown in apramycin-containing me- 
dium at 30°C (data not shown) and 37°C for 4 days, and 
supernatants were examined by TLC and TLC-bioautography 
(Fig. 9). None of the 5. caelestis md::SK7595 isolates, grown at 
either temperature, produced a spot on the TLC plate at the R f 
of niddamycin. In addition, no bioactivity was observed in the 




-nld- 




nidstd 



pSK7595 
isolates 



pSK7595 
isolates 



. FIG. 9. Niddamycin production in 5. cae/ejiiy(pSK7595). (A) TLC analysis of 
ethyl acetate extractions of culture supernatants. Lanes: wt (wild type), S. cae- 
lestis NRRL-2821; 1 to 3, isolates of 5. caelestis(pSK159S). nid std (niddamycin 
standard), 10 \ig of niddamycin. The R f of niddamycin (nid) is indicated by the 
arrow. (B) TLC-bioautography analysis of ethyl acetate extractions of culture 
supernatants. The lane assignments and niddamycin location arc identified as in 
panel A Dark spots arc zones of inhibition of 5. aureus due to the presence of 
niddamycin. 



TLC-bioautography assay, indicating that the niddamycin PKS 
cluster had been disrupted and that the cloned genes were 
from the niddamycin pathway. 

DISCUSSION 

We have reported the isolation and characterization of the 
PKS genes involved in the biosynthesis of the macrolide anti- 
biotic niddamycin. The initial approach to the isolation of this 
cluster was to PCR amplify AT regions based on amino acid 
sequences which are found to be highly conserved among 
PKSs. We generated primers corresponding to regions in the 
KS and AT domains with the expectation of amplifying DNA 
corresponding to portions of all seven domains hypothesized to 
be present in the cluster. (The subsequent discovery that a KS Q 
domain preceding the loading AT brings the number of po- 
tentially amplifiable regions to 8.) The PCR resulted in the 
coding region for only one AT, that from module 5, being 
amplified. On reexamination of the homology of the niddamy- 
cin nucleotide sequences to the primers, it was found that there 
was only one mismatch near the 5' end of the KS primer 
among the eight niddamycin KS nucleotide sequences but a 
much higher frequency of mismatches at the 3' end of the AT 
primer. For example, there were six mismatches each for five 
of the niddamycin AT nucleotide sequences, three mismatches 
for one and one for another. The coding region for module 5, 
which was successfully amplified, had no mismatches. There- 
fore, this PCR approach proved fruitful for generating a nid- 
damycin-specific probe to isolate the remaining coding regions 
of the cluster but not for simultaneously amplifying the coding 
regions for multiple domains. 

Probing the S. caelestis genome with either the PCR-gener- 
ated probe or a probe from a conserved region of the eryth- 
romycin PKS generated the same hybridization patterns, indi- 
cating that these were the only type 1 PKS sequences present 
in the strain. It has been observed that some strains of Strep- 
tomyces, such as Streptomyces hygroscopicus 29253, contain 
multiple PKS clusters (18, 26), which can complicate isolation 
of a particular cluster by hybridization. This does not appear to 
be the case in S. caelestis. When the PCR-generated probe was 
used for Southern analysis of Ssfl-digested S. caelestis DNA, 
we identified seven cross-hybridizing bands which corre- 
sponded to seven of the Sstl fragments encoding the niddamy- 
cin PKS. 

Sequence analysis of the isolated DNA fragments revealed a 
PKS cluster comprising seven modules as expected for synthe- 
sis of niddamycin. The modules are encoded by five ORFs, 
with two modules each encoded by ORFs 1 and 3 and one each 
encoded by ORFs 2, 4, and 5. The organization of the enzy- 
matic domains contained within each module is also consistent 
with what would be expected for synthesis of the macrolide 
ring of niddamycin. An unexpected KS domain (KS°) with a 
glutamine replacing a conserved cysteine residue was found at 
the amino terminus of the PKS. The function of this domain, if 
any, remains unclear. Platenolide A is the polyketide backbone 
of the macrolide antibiotic spiramycin and is identical in struc- 
ture to the polyketide backbone of niddamycin. A genetic map 
of the spiramycin PKS was recently published (17), and the 
organization and domain content of the modules, including the 
unusual KS Q , are the same as those found in this study for the 
niddamycin cluster. The degree of genetic relatedness of these 
clusters awaits the release of the spiramycin PKS nucleotide 
sequence. 

Previous work has suggested that AT sequences cluster into 
families based on substrate specificity (13, 28). As predicted 
from the structure of niddamycin, the AT domains of modules 



7522 KAKAVAS ET AL. 



J. Bacteriol. 



1, 2, 3, and 7 and the loading AT fall into the class of ATs 
which utilize malonate. The remaining ATs fall into the meth- 
ylmalonate class. The methyl group at C-8 is consistent with 
AT4 being an mraAT. AT5 and AT6, however, are predicted to 
utilize, respectively, ethylmalonyl-CoA and a CoA derivative of 
unknown structure which would result, perhaps by further 
modification, in the methoxy group at C-4. It should also be 
noted that the loading AT for the erythromycin PKS also falls 
within the methylmalonate class even though the substrate for 
this AT is propionyl-CoA. Thus, it seems that the methylma- 
lonyl grouping may encompass ATs which recognize a broader 
range of substrates than previously predicted. As a result, it 
may be difficult to assign substrate specificity to ATs not as- 
signed to known PKS clusters or to those whose modular 
positions have not been determined within a known cluster 
solely on the basis of its homology with other members of the 
methylmalonyl class. 

Domain replacements within the erythromycin PKS have 
been shown to result in the production of novel bioactive 
compounds (11, 22, 25). The ethylmalonyl AT domain (AT5) 
and the methoxy AT domain (AT6) of the niddamycin cluster 
could conceivably be used to replace the mmAT domains in 
the erythromycin PKS to generate erythromycin derivatives 
with novel polyketide backbone structures that would be dif- 
ficult to produce by chemical methods. 

ACKNOWLEDGMENTS 

We thank R. Summers for KS/AT primers, M. Staver for assisting in 
genomic library construction, T. Kavanaugh and S. Nannapeneni for 
synthesis of oligonucleotides, and P. Youngman for insightful sugges- 
tions. 

REFERENCES 

L Altschul, S. F„ W. Gish, W. Miller, E. W. Myers, and D. J. Lipman. 1990. 
: Basic local alignment search tool. J. Mol. Biol. 215:403-410. 

2. Amy, C. M., A. Witkowski, J. Naggert, B. Williams, Z. Randhawa, and S. 
. Smith. 1989. Molecular cloning and sequencing of cDNAs encoding the 

.. ; : entire rat fatty acid synthase. Proc. Natl. Acad. Sci. USA 86:3114-3118. 

3. Aparicio, J. F., L Molnar, T. Schwecke, A. Konig, S. F. Haydock, L. E. Khaw, 
J. Staunton, and P. Leadlay. 1996. Organization of the biosynthetic gene 
cluster for rapamycin in Sirepiomyces hygroscopicus: analysis of the enzymatic 

■ domains in the modular polyketide synthase. Gene 169:9-16. 

4. Baltz, R H. 1980. Genetic recombination in Streplomyces fradiac by proto- 
plast fusion and cell regeneration. Dev. Ind. Microbiol. 21:43-54. 

5. Bevitt, D. J., J. Cortes, S. F. Haydock, and P. F. Leadlay. 1992. 6-Dcoxy- 

- • erythronolidc B synthase 2 from Saccharopolyspora erythraea. Cloning of the 
. . structural gene, sequence analysis and inferred domain structure of the 

multifunctional enzyme. Eur. J. Biochcm. 204:38-49. 

6. Bierman, M., R. Logan, K. O'Brien, E. T. Seno, R. N. Rao, and B. E. 
Schoner. 1992. Plasmid cloning vectors for the conjugal transfer of DNA 

— from Escherichia coli to Streplomyces spp. Gene 116:43-49. 

7. Cortes, J., S. F. Haydock, G. A. Roberts, D. J. Bevitt, and P. F. Leadlay. 1990. 
An unusually large multifunctional polypeptide in the erythromycin-produc- 
ing polyketide synthase of Saccharopolyspora erythraea. Nature 348:176-178. 

8. Devereux, J., P. Haeberli, and O. Smithies. 1984. A comprehensive set of 
sequence analysis programs for the VAX. Nucleic Acids Res. 12:387-395. 

9. Donadio, S., M. J. Staver, J. B. McAlpine, S. J. Swanson, and U Katz. 1991. 
Modular organization of genes required for complex polyketide biosynthesis. 
Science 252:675-679. 

10. Donadio, S., and L. Katz, 1992. Organization of the enzymatic domains in 
the multifunctional polyketide synthase involved in erythromycin formation 
in Saccharopolyspora erythraea. Gene 111:51-60. 

11. Donadio, J. B. McAlpine, P. J. Sheldon, M. Jackson, and L. Katz. 1993. 
An erythromycin analog produced by reprogramming polyketide synthesis. 

, Proc. Natl. Acad. Sci. USA 90:7119-7123. 

Ua.Fager, E. E M T. J. Oliver, J. F. Prokop, and A. C. Sinclair. November 1965. 
U.S. patent 3218239. 



12. Gandecha, A. R, S. L. Large, and E. Cundliffe. 1997. Analysis of four tylosin 
biosynthetic genes from the tylLM region of the Streplomyces fradiac genome. 
Gene 184:197-203. 

13. Haydock, S., J. F. Aparicio, I. Molnar, T. Schwecke, A. Konig, A. F. A. 
Marsden, I. S. Galloway, J. Staunton, and P. F. Leadlay. 1995. Divergent 
structural motifs correlated with the substrate specificity of (methyl )raalonyi- 
CoAiacylcarrier protein transacylase domains in the modular polyketide 
synthases. FEMS Lett. 374:246-248. 

14. Hopwood, D. A^ M. J. Bibb, K. F. Chater, T. Kieser, C. J. Broton, H. M. 
Kieser, D. J. Lydiate, C. P. Smith, J. M. Ward, and H. Scherapf. 1985. 
Genetic manipulation of Streplomyces: a laboratory manual. The John Innes 
Foundation, Norwich, England. 

15. Hopwood, D. A n and D. H. Sherman. 1990. Molecular genetics of polyketide 
and its comparison to fatty acid biosynthesis. Annu. Rev. Genet 24:37-66. 

16. Huber, V. G., K. H. Wallhausser, L. Fries, A. Steigler, and H.-L. Weiden- 
muller. 1962. Niddamycin, ein neues makrolid-antibioticum. Arzneira. For- 
sch. 12:1191-1195. 

17. Kuhstoss, S„ M. Huber, J. R. Turner, J. W. Paschal, and R. N. Rao. 1996. 
Production of a novel polyketide through the construction of a hybrid 
polyketide synthase. Gene 183:231-236. 

18. Lomovskaya, N., L. Kon stein, X. Ruan, D. Stassi, L. Katz, and C. R. Hutchin- 
son. 1997. Gene disruption and replacement in the rapamycin-producing 
Streplomyces hygroscopicus strain ATCC 29253. Microbiology 143:875-883. 

19. Merson-Davles, L. A., and E. Cundliffe. 1994. Analysis of five tylosin bio- 
synthetic genes from the tyllBA region if the Streplomyces fradiae genome 
Mol. Microbiol. 13:349-355. 

20. Molnar, L, J. F. Aparicio, S. F. Haydock, L. E. Khaw, T. Schwecke, and D. 
O'Hagan. 1993. Biosynthesis of fatty acid and polyketide metabolites. Nat. 
Prod. Rep. 10:593-624. 

21. Muth, G., B. Nussbaumer, W. Wohlleben, and A. Puhler. 1989. A vector 
system with temperature-sensitive replication for gene disruption and mu- 
tational cloning in streplomyces. Mol. Gen. Genet. 219:341-348. 

22. Oliynyk, M., M. J. B. Brown, J. Cortes, J. Staunton, and P. F. Leadlay. 1996. 
A hybrid modular polyketide synthase obtained by domain swapping. Chem. 
Biol. 3:833-839. 

23. Omura, S., A. Nakagawa, H. Takeshima, K. Atsumi, J. Miyazawa, F. Piriou, 
and G. Lukacs. 1975. Biosynthetic studies using 13 C enriched precursors on 
the 16-membered macrolide antibiotic leucomycin A 3 . J. Am. Chem. Soc. 
97:6600-6602. 

24. Omura, S., K. Tsuzuki, A. Nakagawa, and G. Lukacs. 1983. Biosynthetic 
origin of carbons 3 and 4 of leucomycin aglycone. J. Antibiot. 36:611-613. 

25. Ruan, X., A. Pereda, D. L. Stassi, D. Zeidner, R. G. Summers, M. Jackson, 
A. Shivakumar, S. Kakavas, M. J. Staver, S. Donadio, and L. Katz. 1997. 
Acyltransferasc domain substitutions in the erythromycin polyketide syn- 
thase yield novel erythromycin derivatives. J. Bacteriol. 179:6416-6425. 

26. Ruan, X., D. Stassi, S. Lax, and L. Katz. A second type-I PKS gene cluster 
isolated from Streplomyces hygroscopicus ATCC 29253, a rapamycin-produc- 
ing strain. Gene, in press. 

27. Sarobrook, J., E. F. Fritsch, and T. Maniotis. 1989. Molecular cloning: a 
laboratory manual, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y. 

28. Schwecke, T., J. F. Aparicio, I. Molnar, A. Konig, L. E. Khaw, S. F. Haydock, 
M. Oliynyk, P. Caffrey, J. Cortes, J. B. Lester, G. A. Bohm, J. Staunton, and 
P. F. Leadlay. 1995. The biosynthetic gene cluster for the polyketide immu- 
nosuppressant rapamycin. Proc. Natl. Acad. Sci. USA 92:7839-7843. 

29. Scrutton, N. S. t A. Berry, and R N. Perham. 1990. Redesign of the coenzyme 
specificity of a dehydrogenase by protein engineering. Nature 343:38-42. 

30. Strohl, W. R. 1992. Compilation and analysts of DNA sequences associated 
with apparent strep tomycete promoters. Nucleic Acids Res. 20:961-974. 

31. Tuan, J. S., J. M. Weber, M. J. Staver, J. O. Leung, S. Donadio, and L Katz. 
1990. Cloning of genes involved in erythromycin biosynthesis from Saccha- 
ropolyspora erythraea using a novel actinomycete-£jc/»eric/jia coli cos mid. 
Gene 90:21-29. 

32. Witkowski, A n V. S. Rangan, Z. I. Randhawa, C. M. Amy, and S. Smith. 1991. 
Structural organization of the multifunctional animal fatty-acid synthase. 
Eur. J. Biochcm. 198:571-579. 

33. Wright, F., and M. J. Bibb. 199Z Codon usage in the G+C-rich Streplomyces 
genome. Gene U3:55-o"5. 

34. Yamamoto, H., K. H. Maurer, and C. R. Hutchinson. 1986. Transformation 
of Streplomyces erythraeus. J. Antibiot. 39:1304-1313. 

35. Yanisch-Perron, C, J. Vieira, and J. Messing. 1985. Improved M13 phage 
cloning vectors and host strains: nucleotide sequences of the M13mpl8 and 
pUC19 vectors. Gene 33:103-119. 



Proc. Natl. Acad. Sci. USA 

Vol. 95, pp. 7305-7309, June 1998 

Biochemistry 

Ethyl-substituted erythromycin derivatives produced by directed 
metabolic engineering 

D. L. STASSltt, S. J. KAKAVASt, K, A. REYNOLDS§, G. GUNAWARDANAt, S. SWANSONt, D. ZEIDNERt, M. jACKSONt, 
H. LlU§, A. BUKOt, AND L. KATZt 

tpharmaceutical Products Division, Abbott Laboratories. Abbott Park. IL 60064; and institute of Structural Biology and Drug Discovery. Department of 
Medicinal Chemistry, Virginia Commonwealth University, Richmond, VA 23219 

Edited hy Arnold L. Demain, Massachusetts Institute of Technology, Cambridge, MA, and approved April 27, 1998 (received for review 
February 26, 1998) 



ABSTRACT A previously unknown chemical structure, 
6-dcsmethyl-6-ethy1crythromycin A (6-ethyIErA), was pro- 
duced through directed genetic manipulation of the erythro- 
mycin (Er)-producing organism Saccharopolyspora erythraea. 
In an attempt to replace the methyl side chain at the C-6 
position of the Er polykctide backbone with an ethyl moiety, 
the methylmalonate-specific acyltransferase (AT) domain of 
the Er polykctide synthase was replaced with an ethylma- 
lonatc-spccific AT domain from the polykctide synthase in- 
volved in the synthesis of the 16-member macrolide niddamy- 
cin. The genetically altered strain was found to produce ErA, 
however, and not the ethyl-substituted derivative. When the 
strain was provided with precursors of ethylmalonate, a small 
quantity of a macrolide with the mass of 6-ethylErA was 
produced in addition to ErA. Because substrate for the 
heterologous AT seemed to be limiting, crotonyl-CoA reduc- 
tase, a primary metabolic enzyme involved in butyryl-CoA 
production in streptomycetes, was expressed in the strain. The 
primary macrolide produced by the reengincered strain was 
6-ethylErA. 



Erythromycin (Er) is a broad-spectrum macrolide antibiotic 
produced by Saccharopolyspora erythraea. The backbone of the 
molecule is a 14-member macrocyclic ring (Fig, 1) that is 
produced through the sequential condensation of one mole- 
cule of propionyl-CoA and six molecules of methylmaionyl- 
CoA by a modular polykctide synthase (PKS). This enzyme 
complex comprises three large multifunctional polypeptides, 
DEBS1, DEBS2, and DEBS3, each of which contains two 
modules, and, in the case of DEBS1, a loading domain to 
initiate synthesis of the polyketide chain (1, 2). Each module 
contains the enzymatic activities necessary for one condensa- 
tion and subsequent reduction of the extender to the growing 
chain. Selection of the appropriate extender unit is accom- 
plished by the acyltransferase (AT) domain present in each 
module (3). 

Recently, genetic engineering has joined synthetic chemistry 
in the production of novel macrolide structures. Hybrid PKSs 
have been constructed through the replacement of AT do- 
mains with those that specify different starter or extender units 
(3-6). These manipulations have resulted in the production of 
Er derivatives lacking methyl groups at C-10 and C-12 (5) and 
those incorporating the branched chain starter units of aver- 
mectin biosynthesis (6). To date, however, there have been no 
manipulations that allow the methyl side chains of Er to be 
replaced by ethyl groups. This type of modification would be 
useful in expanding the structural diversity of hybrid 
polyketides produced by combinatorial biosynthesis. 



The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "advertisement" in 
accordance with 18 U.S.C. §1734 solely to indicate this fact. 

€> 1998 by The National Academy of Sciences (X)27-8424/98/957305-5$2.O0/0 
PNAS is available online at http://www.pnas.org. 



Niddamycin (Nd) is a 16-member macrolide with an ethyl 
side chain at C-6 (Fig. 1). The module 5 AT, which is believed 
to be responsible for incorporation of the ethyl side chain into 
the polyketide backbone, was identified by sequence analysis 
of the Nd PKS genes (7). In this paper we describe the 
construction of a hybrid PKS through the replacement of a 
methylmalonyl-specific AT of the Er PKS with that of the 
ethylmalonyl-specific AT of the Nd PKS. This substitution 
alone, however, was not sufficient to produce an Er derivative 
with an ethyl side chain; additional manipulations of the 
carboxylic acid precursor pools were necessary to generate the 
desired compound. 

MATERIALS AND METHODS 

Strains, Plasmids, and Media. The wild-type Er producer is 
S. erythraea ER720 (8). Plasmid pWHM3 (9) is an Escherichia 
coli-Streptomyces shuttle vector that is maintained in S, eryth- 
raea only when it contains heterologous DNA for chromo- 
somal integration. Plasmid pDPE81 is a derivative of pKAS37 
(10) in which a 1.7-kb BglW fragment containing the hygro- 
mycin resistance marker was inserted into the BglW site of 
pKAS37. E. coli DH5a (Life Technologies, Gaithersburg, MD) 
was the host used for plasmid construction and isolation. 
Liquid cultures of 5. erythraea strains were grown in SGGP (11) 
for production of protoplasts and SCM (5) for metabolite or 
enzyme analysis. Plate cultures of S. erythraea strains were 
grown on R3M medium (5). Thiostrepton (Ts)-resistant strains 
of 5. erythraea were grown in 25 p-g/ml and 10 /xg/ml Ts for 
plate and liquid cultures, respectively. 

Plasmid pEAT4 (Fig. 2A) was constructed as follows. Cos- 
mid pAIBX85, a pWHM3 derivative containing DNA from 
modules 3 and 4 of the Er PKS (corresponding to nucleotides 
979-9349; GenBank accession no. M63677), was used to clone 
DNA flanking eryAT4. The 5' flanking region was isolated by 
digesting pAIBX85 with Msc\ and BstEW (nucleotides 4247- 
6033), treating with the Klenow fragment of DNA polymerase, 
and ligating the fragment into the Sma\ site of pUC19 to 
generate pUC/5' flank. An AvrW site was engineered 13 bp 
downstream of the BstEW site by PCR amplification of a 
306-bp region of DNA from the Pml\ site (nucleotide 5739) to 
12 bp 3' of the BstEW site (nucleotide 6045). The engineered 
AvrW site does not change the Pro-Arg residues encoded by this 
region (Fig. 2fl). A BamHl site was also included on the PCR 
primer just downstream of the/lvrll site. The resulting frag- 
ment was digested with Pml\ and BamYU and cloned into the 
Pmll/BamHl site of pUC/5' flank, replacing the native se- 



This paper was submitted directly (Track II) to the Proceedings office. 
Abbreviations: 6-ethylErA, 6-desmethyl-6-ethyleryihrornycin A; AT, 
acyltransferase; Ccr, crotonyl-CoA reductase; Er, erythromycin; Nd, 
niddamycin; PKS, polyketide synthase; Ts, thiostrepton. 
*To whom reprint requests should be addressed at: Abbott Labora- 
tories, CAPD, D-4FR F3-3, 1401 Sheridan Road, North Chicago, IL 
60064. e-mail: diane.stassi@abbott.com. 



7305 



7306 Biochemistry: Stassi el ai 



Proc. Natl. Acad. Sci. USA 95 (1998) 



CH 3 




desosamine 



cladinose 




mycaminose 

mycarose 

isobutyrate 



CH 3 



Ov >\ 


"1 OH 

\ ^ desosamine 


HO vJ 




CH 3 >p 


^ cladinose 


0 





Erythromycin A Niddamycin 6-Desmethyl-6-ethylerythromycin A 

Fig. 1. Structures of Er, Nd, and 6-ethylErA. Shadowed letters represent relevant side groups. 



quence and resulting in vector pUC/5' flank/zlvrll. The 3' 
flanking DNA was isolated by digesting pAIXB85 with Pmll 
and Mscl (nucleotides 6999 and 8977), treating with the 
Klenow enzyme, and ligating the fragment into the Smal site 
of pUCl9, generating vector pUC/3' flank. nidAT5 was 
isolated by PCR amplification of cosmid 13f5 (7), which 
contains DNA spanning module 5 of the Nd PKS cluster. The 
5' oligonucleotide was designed to create an Avrll site 12 
nucleotides upstream of the beginning of the nidATS domain 
(nucleotide 25290; GenBank accession no. AF016585) and to 
place the amino acid sequence Pro-Arg-Lys-Pro in front of 
nidATS to correspond to the sequence that is found upstream 
of the eryAT4 domain. The 3' oligonucleotide generated an 
Fsel site at the end of the nidAT5 domain (nucleotide 26284, 
GenBank accession no. AF016585), resulting in a conservative 
Val to Ala change. A Bam HI site was also incorporated after 
the Fsel site. The fragment was digested with/tvrll and BamH\ 
and ligated into A vr\ I /Bam HI -digested pUC/5' flank//tvrll, 
creating vector pUC/5' flank/nidATS. The 3' flanking DNA 
was then cloned onto the 3' end of nidATS by first digesting 
pUC/3' flank with Fsel and Bam HI, gel purifying the 1920-bp 



4247 



nidATS 



7045 



8977 



B 



ervAT4wtld type 
nidAT5wiU type 
eryAT4 flank/nidATS 



eryAT4 wild type 
nidATS wild type 
eryAT4nankMidAT5 



N-terminus 

APVTSAPRKP VLVFPGOGAO 
TGTAGTAORV VFVFPGOGSO 
APVTSAPRKP VFVFPGOGSO 



Avrll site 



C-terminus 

ADLRPAVAGG RPAELPTYP6 
OPEAVFGPGA RPVELPTYPF 
OPEAVFGPGA RPAELPTYPF 



T 

Fsel site 

Fig. 2. {A) Diagrammatic representation of insert in vector 
pEAT4. DNA encoding the AT from module 5 of the Nd PKS cluster 
(nidAT5) was cloned between DNA flanking the 5' and 3' boundaries 
of the AT coding region in module 4 of the Er PKS cluster. The 
numbered boxes correspond to eryA sequence coordinates from 
GenBank accession no. M63676. (B) Amino acid comparisons in the 
junction regions of eryAT4. nidATS, and the eryAT4 flank/nidATS 
construct. The amino acids encoding the Avrll and Fsel sites are 
indicated. 



fragment, and ligating it into Fsel //torn Hl-digested pUC/5' 
flank/nidATS, creating vector pUC/nidAT5/C6«fiank. To 
generate pEAT4, the nidAT5/flanking DNA cassette was 
isolated from pUC/nidAT5/C6-flank by digestion with EcoRl 
and Hindlll and then ligated to EcoRl/Hind Ill-digested 
pWHM3. 

Plasmid pDPE-ccr (Fig. 3) was constructed as follows. The 
Streptomyces collinus crotonyl-CoA reductase (Ccr) gene (ccr) 
was subcloned from plasmid pZYB3 (12) by digestion with 
Xbal and ZtomHI, which releases ccr and the upstream T7 
ribosomal binding site. This fragment was treated with the 
Klenow enzyme and ligated into the polylinker of pDPE81 that 
had been digested with EcoRl and treated with the Klenow 
enzyme. The polylinker of pDPE81 is in the center of a 10-kb 
fragment of 5. erythraea chromosomal DNA, which directs 
integration into the chromosome at a site that does not seem 
to. affect Er production. Plasmid pDPE-ccr was designed so 
that the ccr gene expressed from theewiE* promoter (13) and 
the Ts-resistance marker are left behind in the chromosome 
following a double-crossover event. 

Genetic Manipulations. Standard molecular biology tech- 
niques were performed as described (14). Enzymes and re- 
agents were purchased from Life Technologies. Protoplast 
transformation and marker replacement in S. erythraea were 
performed as described (5). One microgram of plasmid DNA 
was routinely used for protoplast transformations. Putative 
transformants were grown in SGGP medium containing 10 
/xg/ml Ts to confirm resistance. For chromosomal eviction of 
pEAT4, transformants were passaged twice in SGGP medium 
without Ts, cells were plated for spores, and individual colo- 
nies arising from spores were screened for Ts sensitivity. 



Barn HI 




Nhd 



KpnJ 



MM 



Bamtti 



Fig. 3. Plasmid pDPE-ccr. The thick black lines represent S. 
erythraea DNA, which allows for integration of the plasmid by homol- 
ogous recombination into the S. erythraea chromosome. Also indicated 
are genes for ampicillin (amp) and Ts resistance. 



Biochemistry: Stassi et al 



Proc. Natl. Acad. ScL USA 95 (1998) 7307 



Electroporation was performed as described (15) by using 1 
of pDPE-ccr DNA. Because the plasmid is unstable when 
integrated into the S. erythraea chromosome, two consecutive 
platings on Ts-containing R3M plates results in resolution and 
eviction of plasmid sequences. PCR and Southern hybridiza- 
tion were as described (7). 

Ccr Assay. Seed cultures of S. erythraea grown in SCM and 
S. collinus grown in medium A (16) were diluted 5-fold into 
SCM and medium A, respectively, and grown for 48 h at 30°C. 
Cells were harvested, opened with a French pressure cell, 
centrifuged to obtain cell-free extracts, and assayed spectro- 
photometrically for Ccr activity as described (16). One unit of 
Ccr activity is defined as I jimol of NADPH oxidized in 1 min. 

Isolation and Identification of Metabolites. Small-scale 
(milliliters) isolation of metabolites from S. erythraea, TLC, 
and MS analysis were performed as described (5). Large-scale 
isolation for compound identification was as follows. Fermen- 
tations were conducted in 42-liter LH fermentation series 2000 
stainless steel vessels (LH Fermentation, Maidenhead, UK). 
Each fermentor was charged with 30 liters of a medium 
consisting of 1.5% soluble starch, 2.2% soybean flour, 0.2% 
CaCOj, 0.15% brewer's yeast (Wind Gap Farms, Baconton, 
GA), 0.102% MgS0 4 -7H 2 0, 0,0027% FeS0 4 -7H 2 0, and 1% 
soybean oil. Sterilization was at 121°C and 15 psi (1 psi = 6.89 
kPa). A two-step seed protocol was used. Vegetative growth 
from a frozen stock of S. erythraea strain EAT4-ccr was 
inoculated at 1.5% into a 500-ml Erlenmeyer flask containing 
100 ml of the following medium: 1.5% glucose monohy- 
drate/1% soluble starch/ 1% Soytone (Difco)/0.9% yeast ex- 
tract (Difco)/0.21% Mops/0.001% Ts. For the second step, a 
2-liter Erlenmeyer flask containing 600 ml of the same me- 
dium was inoculated at 5% with the first-step growth. Both 
steps were incubated at 28°C on a rotary shaker at 225 rpm for 
48 h. The fermentor was inoculated at 5% with the second-step 
seed growth. Fermentor temperature was controlled at 32°C, 
agitation was 250 rpm, aeration was 1 vol-vol^'-min" 1 , and 
head pressure was 6 psi. Silicone antifoam was added at 0.01% 
initially and was available on demand. Harvest was at 108 h. 

The fermentation broth (28 liters) was filtered, the pH was 
adjusted to 9 with NH 4 OH, and the broth was extracted with 
CH 2 CI 2 (twice with 14 liters each). The combined extracts were 
concentrated, and the residue was partitioned between the two 
phases of the heptane/methanol/0.02 M K 2 HP0 4 system (pH 
6; 1:1:1). The aqueous phase was evaporated, and the residue 
was dissolved in 0.05 M aqueous potassium phosphate buffer 
(100 ml), adjusted to pH 9 with NH 4 OH, and extracted twice 
with equal volumes of CH 2 Ci 2 . The CH 2 Cl 2 -soluble fraction 
(1.65 g) was separated by droplet cbuntercurrent chromatog- 
raphy (100 vertical columns; 0.4 X 24 cm) by using the upper 
layer from the hexane/ethyl acetate/0.02 M K 2 HP0 4 (pH 8; 
1:1:1) system as the mobile phase. The fractions were analyzed 
by bioassay against Staphylococcus aureus and by 1 H NMR. The 
bioactive fractions that were shown to contain macrolides by 
NMR were pooled, dried (0.4 g), and chromatographed on a 
Sanki Engineering (Kyoto) HPLC centrifugal partition chro- 
matograph by using the hexane/ethyl acetate/0.02 M K 2 HP0 4 
(pH 6; 1:1:1) system. The active fraction was further purified 
by HPLC on C J8 -derivatized silica by using an acetonitrile/ 
methanol/0.01 M (CH 3 ) 4 NOH/0.05 M KH 2 P0 4 (pH 6; 
73:10:59:59) system. Two-dimensional NMR experiments, in- 
cluding double-quantum correlation, heteronuclear multiple 
quantum correlation, and heteronuclear multiple-bond corre- 
lation, performed in C 2 HC1 3 , were used for structure elucida- 
tion and the complete assignment of the 'H and l3 C NMR 
signals. 

In Vitro Antibacterial Activity. Antibacterial activity was 
determined by the broth microdilution method (National 
Committee for Clinical Laboratory Standards ( Villanova, PA) 
M7-A4, 1997), except that Brain Heart Infusion (Difco) broth 
was used as the test medium. Assays were incubated overnight 



at 35°C. Minimal inhibitory concentrations were defined as the 
lowest drug concentration (pig/ml) inhibiting bacterial growth. 

RESULTS 

Construction of S. erythraea EAT4. Plasmid pEAT4 was 
constructed to replace DNA encoding the AT domain of 
module 4 in the Er PKS (eryAT4) with DNA encoding the AT 
domain from module 5 of the Nd PKS (nidAT5) (7). S. 
erythraea ER720 protoplasts were transformed with pEAT4 
DNA, and 10 transformants were obtained. Genomic DNA 
was extracted from one of the transformants for Southern 
analysis, in which probing with pWHM3 DNA confirmed the 
integration of pEAT4 at the appropriate location in the 
chromosome (data not shown). Nonselective growth to allow 
plasmid eviction through a double-crossover event yielded 96 
colonies, of which 9 were found to be Ts sensitive. Southern 
analysis of the 9 clones showed that 3 had nidAT5 DNA 
sequences in place of eryAT4 chromosomal sequences, and the 
remaining 6 had segregated to wild type (data not shown). 

Characterization of S. erythraea EAT4. To analyze the Er 
derivatives produced by S. erythraea EAT4, the three isolates 
and four of the wild-type segregants were grown in SCM 
medium for 4 d. The four wild-type segregants produced spots 
indistinguishable in color and R t from the ErA standard. The 
three strains in which nidAT5 replaced the AT of module 4 of 
the Er PKS produced spots similar in R f and color to the 
wild-type segregants, but the spots were much less intense 
(data not shown). 

To determine the mass of the compound produced by 5. 
erythraea EAT4, the supernatant of a 50-ml SCM culture of 
one of the isolates was extracted with ethyl acetate. The extract 
was subjected to TLC, but only the edges of the plate were 
sprayed with anisaldehyde to locate the region of interest. A 
1-cm band of resin was scraped from the unsprayed portion of 
the plate at the Rf of ErA. The resin was extracted twice with 
500 fx\ of an ethyl acetate/methanol (2:1) solution, and the 
organic phase was dried and then analyzed by electrospray 
ionization MS. Surprisingly, the mass of the protonated mo- 
lecular ion of the compound was observed at m/z 734, which 
corresponds to the mass of ErA, not an ethyl-substituted ErA 
derivative. 

Feeding of Ethylmalonyl-CoA Precursors to £. erythraea 
EAT4. One hypothesis for the failure of 5. erythraea EAT4 to 
produce an ethyl-substituted derivative is that ethylmalonyl- 
CoA, the substrate for the nidAT5, is lacking. To test this 
hypothesis, cells were grown for 4 d in SCM medium contain- 
ing precursor compounds, and ethyl acetate extracts of the 
cultures were analyzed. When either 50 mM butanol or 50 mM 
butyrate was added to the medium, a second spot running 
slightly faster than the R { of ErA was seen. Buty rate-fed 
cultures (Fig. 4/1) produced about a 1:1 ratio of the two spots, 
whereas the butanol-fed culture produced more of the ErA- 
like spot (data not shown). Addition of 10 mM ethylmalonate 
failed to produce significant amounts of either of the two spots 
described above. However, addition of 10 mM diethylethyl- 
malonate was found to yield much more of the faster migrating 
compound than that seen with the other precursor compounds 
tested and very little compound migrating at the R f of ErA 
(Fig. 4/t). Cells grown in unsupplemented SCM medium 
produced only material migrating like ErA. 

To characterize the newly synthesized compound, extracts of 
the butyrate- and the diethylethylmalonate-fed cultures were 
subjected to TLC, the region of interest was scraped from the 
plate and reextracted for electrospray ionization MS. The 
results showed that the butyrate-fed culture of 5. erythraea 
EAT4 produced approximately equal amounts of compounds 
with protonated molecular ions at m/z 734 and 748. The 748 
species is consistent with an additional methylene group on 
ErA, e.g., with an ethyl group replacing a methyl group on the 



7308 Biochemistry: Stassi et al. 

A b 




Fig. 4. TLC analysis of S. erythraea EAT4 and S. erythraea EAT4- 
ccr. Ethyl acetate extracts of S. erythraea EAT4 cultures grown 
under various conditions. Lanes: 1, SCM medium; 2, SCM + 50 mM 
hutyrate; 3, SCM + 10 mM diethylethylmalonate; 4, ErA standard (5 
H,g). The arrows indicate the position of ErA. (B) Ethyl acetate extracts 
of S. erythraea strains EAT4 (lane 2) and EAT4-ccr (lane 3) grown in 
SCM medium, along with 5.0 /xg of ErA standard (lane 1). The arrow 
indicates the position of ErA. 

macrolide ring. The diethylethylmalonate sample contained 
primarily a compound with a mass of 748, with only a trace of 
ErA (734) present. 

Genetic Manipulation of Ethylmalonyl-CoA Levels in S. 
erythraea EAT4. Successful production of an Er derivative with 
a mass consistent with the addition of an ethyl side chain by 
hutyrate- and diethylmalonate-fed S. erythraea EAT4 sug- 
gested that the levels of ethylmalonyl-CoA, the likely substrate 
of the nidAT5, must be limiting. In streptomycetes, Ccr 
catalyzes the last step in the reductive biosynthesis of butyryl- 
CoA from two molecules of acetyl-CoA (12). Because butyryl- 
CoA can then be carboxyiated to form ethylmalonyl-CoA, the 
possibility that Ccr could be used to increase the levels of the 
ethyl-substituted Er derivative was investigated. The ccr gene 
of S. collinus was expressed from the strong ermE* promoter 
at a site unlinked to the Er biosynthetic cluster, in the S. 
erythraea EAT4 chromosome. Plasmid pDPE-ccr was electro- 
po rated into S. erythraea EAT4. About 40 Ts-resistant colonies 
were obtained, of which 7 were confirmed to be stable 
transformants. Genomic DNA was isolated from 2 of the 
transformants for Southern analysis. Probing with the S, 
collimis ccr gene confirmed that both isolates carried the gene 
at the expected location in the chromosome (data not shown). 
The strain was named S. erythraea EAT4-ccr. 

To assess expression levels of the ccr gene, Ccr activity of 5. 
erythraea EAT4-ccr was compared with that of S. erythraea 
EAT4 (parental strain) and wild-type S. collimis. We found 
(Table 1) that, although there was no detectable activity in the 
.V. erythraea strain without the ccr gene, S. erythraea EAT4-ccr 



Table 1. Ccr activity in S. collinus and engineered 
S. erythraea strains 





Enzyme 


Protein 


Specific 




activity, 


concentration, 


activity, 


Sample 


milliunits/ml 


mg/ml 


milliunits/mg 


S. erythraea EAT4 


<0.5 ± 0.0 


3.0 


<0.2 ± 0.0 


S. erythraea EAT4-ccr 


46.9 ± 2.0 


3.0 


15.6 ± 0.7 


S. collinus wild type 


5.5 ± 1.0 


7.0 


0.8 ±0.1 



Proc. Natl. Acad. Sci USA 95 (1998) 

had about 20 times the relative activity of the wild-type S. 
collimis strain from which the gene originated. 

To determine the effect of the ccr gene on production of an 
ethyl-substituted Er derivative, ethyl acetate extracts of super- 
natants of 4-d SCM-grown cultures of the EAT4 strains with 
and without the ccr gene were examined (Fig. 4B). TLC 
analysis of 10 ml of extracted cells demonstrated that EAT4 
without the ccr gene produced a spot that migrated with ErA, 
whereas EAT4-ccr produced a spot that migrated faster than 
the ErA spot, with no ErA-like material visible. 

Extracts were also analyzed by electrospray ionization MS. 
The EAT4 strain produced a compound with a protonated 
molecular ion at m/z 734, indicating ErA production. The 
EAT4-ccr strain produced a compound with a mass of 748, 
which is consistent with the production of the 6-ethylErA. No 
compound with a mass of 734 (ErA) was detected in extracts 
of this strain. 

Structural Analysis and Biological Activity of Compound 
Produced by S. erythraea EAT4-ccr. The structure of 6-ethyl- 
ErA was confirmed by spectroscopic methods. The high 
resolution fast atom bombardment MS analysis of the sample 
gave an M + H + ion at m/z 748.4846 [calculated for 
C 3 sH 7 oNO,3 m/z = 748.4842 (A 0.5 ppm)J; indicating that this 
molecule has an additional methylene function compared with 
ErA. The presence of an ethyl moiety in the molecule at the 
C-6 position and its relative stereochemistry were confirmed 

Table 2. NMR data of 6-ethylErA 





,3 C shift (8), 


'H shift (5), 


Carbon no. 


ppm 


ppm 


1 


175.5 




2 


44.8 


2.93 


3 


80.7 


3.79 


4 


37.6 


1.93 


5 


81.6 


3.86 


6 


76.7 




7 


38.4 


1.72, 193 


8 


44.7 


2.70 


9 


222.4 




10 


37.7 


3.01 


11 


68.9 


3.82 


12 


74.4 




13 


76.7 


5.06 


14 


21.2 


1.47, 1.91 


15 


10.7 


0.84 


2-CH 3 


15.9 


1.29 


4-CH 3 


9.7 


t.14 


6-CH2-CH3 


27.6 


1.70, 2.02 


6-CH 2 -CH 3 


7.7 


0.98 


8-CH1 


18.6 


1.17 


IO-CH3 


12.3 


3.01 


12-CH3 


68.9 


3.82 


r 


102.5 


4.48 


2' 


72.5 


3.21 


3' 


65.3 


2.44 


4' 


28.3 


1.22, 1.63 


5' 


68.7 


3.52 


6' 


21.6 


1.22 


3'-N(CH 3 ) 2 


40.7 


2.29 


r 


96.2 


4.92 


2" 


35.3 


2.34, 1.66 


3" 


72.4 




4' 


77.7 


3.02 


5" 


66.3 


3.98 


6* 


21.6 


1.29 


3--OCH3 


49.7 


3.33 


3"-CH 3 


21.4 


1.26 



Numbering of carbon atoms of Er is as described (17). 



Biochemistry: Stassi ct al. 



Proc. Natl. Acad. Sci. USA 95 (1998) 7309 



by two-dimensional NMR. The stereospecificity of the ethyl 
moiety was found to be the same as that of the methyl group 
present at C-6 of ErA (Fig. I). The 'H and ,3 C NMR 
assignments are shown in Table 2. 

Antibacterial activity of 6-cthylErA was compared with that 
of ErA against a panel of S. aureus, Staphylococcus epidermidis, 
Streptococcus pyogenes, and Enterococcus faecium strains from 
the Abbott culture collection. Although 6-ethylErA does re- 
tain biological activity, it was found to be 15 to 60 times less 
potent than ErA against these strains. Representative minimal 
inhibitory concentrations (jxg/ml) for 6-ethylErA and ErA, 
respectively, were as follows: S. aureus (4 and 0.06), S. epider- 
midis (4 and 0.12), £. faecium (4 and 0.12), and S. pyogenes (1 
and 0.06). 

DISCUSSION 

In this work we have shown that an ethylmalonate AT can be 
substituted for a methylmalonate AT to produce a hybrid PKS 
that synthesizes the predicted macrolide product, 6-ethylErA. 
This suggests that Er analogs containing ethyl substitutions at 
positions other than C-6 or that ethyl analogs of other mac- 
rolides, such as rifamycin or rapamycin, can be prepared in a 
similar fashion by using the ethylmalonate-specific AT from 
the Nd PKS. 

One key feature for the successful generation of 6-ethylErA 
was availability of ethylmalonyl-CoA as substrate for the 
ethylmalonate-specific AT of the Nd PKS in S. erythraea. This 
AT clearly demonstrates a relaxed specificity by reacting with 
both ethylmalonyl-CoA and methylmalonyl-CoA. In S. eryth- 
raea EAT4, the desired product could be obtained only by 
boosting the levels of ethylmalonyl-CoA by either precursor 
feeding or expression of Ccr, an enzyme involved in butyryl- 
CoA biosynthesis. These results suggest that ethylmalonyl- 
CoA derived from butyryl-CoA is not available at sufficient 
levels to compete with methylmalonyl-CoA for the ethylma- 
lonate-specific AT. Ccr activity is not detectable in 5. erythraea 
and attempts to detect a ccr gene in this organism by hybrid- 
ization with the S. collinus ccr gene have failed (results not 
shown). In contrast, the ccr gene appears to be present in most 
streptomycetes (16), and in certain cases it appears to be 
clustered with antibiotic biosynthetic gene clusters. For exam- 
ple, a ccr homolog has been located in the biosynthetic cluster 
for tylosin (18), another 16-member macrolide with an ethyl 
side chain. Southern analysis of cosmid clones containing Nd 
biosynthetic genes also indicates that a ccr homolog may reside 
close to the Nd PKS (unpublished results). 

It was demonstrated previously (5) that production of 
desmethyl Er derivatives by replacement of methylmalonate- 
specific ATs of the Er PKS with malonate-specific ATs from 
Strcptomyces hygroscopicus and Streptomyces venezuelae was 
successful only in Er modules 1 and 2 (corresponding to C-12 
and C-10 of Er, respectively) and did not give rise to a 
detectable polyketide when placed in module 4. In contrast, we 
successfully produced ethyl-substituted Er by replacement of 
the methylmalonate-specific AT of Er module 4 with the 
ethylmalonate-specific AT from the Nd cluster. In fact, in S. 
erythraea expressing the S. collinus ccr gene, replacements in 
modules 1 through 4 with the ethylmalonate-specific AT 
produced compounds with electrospray ionization mass spec- 
tra consistent with production of ethyl substitutions for methyl 
groups at C-12, C-10, C-8, and C-6, respectively. However, 
those in modules 5 and 6 did not produce compounds of mass 
748 (data not shown). It is not known whether failure to 
produce the predicted compounds was caused by physical 
distortion of the PKS or its mRNA, inability of the PKS to 
process certain altered growing chains, or some structural 



instability of the macrolide itself because of the introduced 
change. Thus, even though genetic information may be present 
for the production of novel compounds, it is still not possible 
to predict which substitutions will yield detectable levels of 
product. 

It has also been demonstrated that the malonyl-specific ATs 
can be distinguished from methylmalonyl-specific ATs through 
sequence alignments (3). By using this strategy, Ruan et ai (5) 
predicted that an AT from an unidentified PKS of S. hygro- 
scopicus was a malonyl-specific AT, and they used it success- 
fully to produce desmethyl Er derivatives. The Nd ethyl- 
specific AT was found to cluster with methylmalonyl-specific 
ATs (7) in similar AT alignments. This similarity may explain 
why methylmalonate was used as a substrate by nidAT5 in 5. 
erythraea EAT4 to produce ErA when ethylmalonyi-CoA was 
not available. 

Finally, structural determination by NMR not only con- 
firmed the presence of the ethyl side chain on the Er derivative 
produced by 5. erythraea EAT4 but also showed that the 
absolute configuration at this chiral center is the same as that 
in ErA. In Er biosynthesis, epimerization is required at C-6 
because the PKS uses (25)-methylmalonyl-CoA for chain 
extension (19). It is unclear whether the cognate epimerization 
occurs when ethylmalonate is used in the synthesis of 6-eth- 
ylErA because the stereochemistry of the ethylmalonate in- 
corporated at C-6 is not known. 

We thank Angela Nilius for performing minimal inhibitory con- 
centration assays and Jim McAlpine, Rich Summers, and Thomas 
Vanden Boom for helpful discussions. This work was supported in part 
by Grant GM50542 (to K.A.R.) from the National Institutes of Health. 

1. Cortes, J., Haydock, S. F.. Roberts, G. A., Bevitt, D. J. & Leadlay, 
P. F. (1990) Nature {London) 348, 176-178. 

2. Donadio, S. & Katz, L. (1992) Gene 111, 51-60. 

3. Oliynyk, M., Brown, M. J. B., Cortes, J., Staunton, J. & Leadlay, 
P. F. (1996) Chem. Biol. 3, 833-839. 

4. Kuhstoss, S., Huber, M., Turner, J. R., Paschal, J. W. & Rao, 
R, N. (1996) Gene 183, 231-236. 

5. Ruan, X., Pereda, A., Stassi, D., Zeidner, D., Summers, R., 
Jackson, M, Shivakumar, A, Kakavas, S. & Katz, L. (1997) J. 
Bacteriol. 179, 6416-6425. 

6. Marsden, A. F. A.. Wilkinson, B.. Cortes, J., Dunster, N. J., 
Staunton, J. & Leadlay. P. F. (1998) Science 279, 199-202. 

7. Kakavas, S. J., Katz, L. & Stassi, D. (1997) I Bacteriol: 179, 
7515-7522. 

8. DeWitt, J. P. (1985) J. Bacteriol. 164, 969-971. 

9. Vara J. A., Lewandowska-Skarbek, M., Wang, Y.G., Donadio, S. 
& Hutchinson, C. R. (1989)7. Bacteriol. 171, 5872-5881. 

10. Maine, G., Post, D., Satter, M. & Stassi, D. (1997) International 
patent application WO 97/06266 (2/20/97). 

11. Yamamoto, H., Maurer, K. H. & Hutchinson, C, R. (1986) / 
Antibiot. 39, 1304-1313. 

12. Wallace. K. K., Han, U Lobo, S. & Renolds, K. K. (1997) J. 
Bacteriol. 179, 3884-3891. 

13. Bibb, Ivt. J., White, J., Ward, J. M. & Janssen, G. R. (1994) Mot. 
Microbiol. 14, 533-545. 

14. Sambrook, J., Fritsch, E. F. & Maniatis, T. (1989) Molecular 
Cloning: A Laboratory Manual (Cold Spring Harbor Lab. Press, 
Plainview, NY), 2nd Ed. 

15. Fitzgerald, N. B., English, R. S„ Lampei, J. S. & Vanden Boom, 
T. (1998) AppL Environ. Microbiol., 64, 1580-1583. 

16. Wallace, K. K., Bao, Z.-Y., Dai, H. Digate, R„ Schuler, G., 
Speedie, M. & Reynolds, K. (1995) Eur. J. Biochem. 233, 954-962. 

17. Larty, P. A. & Faghih, R. (1993) in Recent Progress in the 
Chemical Synthesis of Antibiotics, eds. Lukacs, G & Ohno, M. 
(Springer. New York), p. 121. 

18. Gandecha, A. R., Large, S. L. & Cundliffe. E. (1997) Gene 184, 
197-203. 

19. Weissman, K. J., Timoney, M, Bycroft. M., Grice, P., Hanefeld, 
U„ Staunton, J. & Leadlay, P. F. (1997) Biochemistry 36, 13849- 
13855. 



Gene. 111 (1992)51-60 

© 1992 Elsevier Science Publishers B.V. All righto reserved. 0378-1 11 9/92/S05.00 
GENE 06278 



Organization of the enzymatic domains in the multifunctional polyketide synthase invoked in erythro- 
mycin formation in Saccharopolyspora erythraea 

(FAS; fatty acids; macrolide antibiotic; sequence alignments; Streptomyces) 



Stefano Donadio and Leonard Kate 

Corporate Molecular Biobgy. Abbott Laboratories. Abbott Park. IL 60064 (U.Sui.) 

Received by CR. Hutchinson: 21 August 1991 
Revised/Accepted: IS October/17 October 1991 
Received at publishers: 12 November 1991 



SUMMARY 

Localization of the enzymatic domains in the three multifunctional polypeptides from Saccharopolyspora erythraea involved 
in the formation of the polyketide portion of the macrolide antibiotic erythromycin was determined by computer-assisted 
analysis. Comparison of the six synthase units (SU) from the eryA genes with each other and with mono* and multifunctional 
fatty acid and polyketide synthases established the extent of each 0-ketoacyi acyl-canier protein (ACP) synthase, acyl- 
transferase, /Mcetoreductase, ACP, and thioesterase domain. The extent of the enoyl reductase (ER) domain was established 
by detecting similarity to other sequences in the database. A segment containing the putative dehydratase (DH) domain 
in EryAII, with a potential active-site histidine residue, was also found. The finding of conservation of a portion of the 
DH-ER interdomain region in the other five SU, which lack these two functions, suggests a possible evolutionary path for 
the generation of the six SU. 



INTRODUCnON 

Erythromycin, a macrolide antibiotic produced by S. ery- 
thraea, is composed of the polyketide-derived 14-mem- 
bered macrolactone ring, 6dEB, to which are attached two 
deoxysugars, cladinose and desosamine. Synthesis of 6dEB 



Correspondence to: Dr. L. Katz, Abbott Laboratories, D-93D, One Abbott 
Park Rd. v Abbott Park, IL 60064 (U.S.A.) 
Tel. (708)937-4132; Fax (708)938^046. 

Abbreviations: aa» amino atid(s); ACP, acyl-carrier protein; ACP-S, ACP 
of SU1; AT, acyttransferase; Co A, coenzyme A; 6dEB, 6-deoxyerythro- 
nofide B; DH, dehydratase; dnaB. gene encoding hehcasc; ER, enoyl 
reductase; ery. erythromycin biosynthesis gene; eryA. gene encoding 6dEB 
synthase; FAS, fatty acid synthase; FAS1, S. cerevistoe FAS 0-chain; 
KR, /Wcetoreductase; KS, 0-ketoacyl ACP synthase; 6MSAS, 6- 
methylsalicyfic acid synthase; ORF, open reading frame; PKS, polyketide 
synthase; S., Saccharopolyspora; SU, synthase unit(s); TE, thioesterase; 
URF, unidentified ORF. 



involves six elongation cycles that resemble the steps in 
fatty acid synthesis. It has recently been shown that 6dEB 
synthesis requires three adjacent eryA genes encoding large 
multifunctional polypeptides and that the eryA cluster con- 
sists of six modules (repeated motifs), each encoding a 
different SU specific for one of the elongation steps (Cortes 
et al., 1990; Donadio et al., 1991). We proposed that the 
genetic organization of eryA and the steps in the biochem- 
ical pathway of 6dEB are colinear (Donadio et al., 1991). 
A scheme of the enzymatic activities leading to 6dEB is 
shown in Fig, 1. 

In previous work, the FAS-like activities ACP, AT, KR 
and KS in the multifunctional e r^-encoded polypeptides 
were identified from the presence of 'signature sequences 9 
found at the active site of the functional domains (Cortes 
et aL, 1990; Donadio et al, 1991). However, signature se- 
quences have not been assigned previously to the DH and 
ER functions, both of which have been poorly character- 
ized biochemically. Multifunctional FAS systems are or- 



52 




sua 



6U2 



SU1 



r 



^ "oh 



'OH 



SUS 



sue 



Fig. 1. Scheme of 6-deoxyerythronolide B synthesis (Don&dio el al., 1991). The top portion shows the three eryA -encoded polypeptides containing SU 
1 through 6. Enzymatic activities belonging to the first and to the second SU for each polypeptide are represented by empty and shaded circles, respec- 
tively. The bottom portion illustrates the role of each SU in the synthesis of 6dEB, where the C 2 units in the ring introduced by odd- and even-numbered 
SU are represented by dashed and continuous lines, respectively. 



ganized in discrete functional domains which can be re- 
solved upon limited proteolysis (Wakil, 1989). The eryA- 
determined SU can be assumed to consist of linear domains 
on the basis of their similar organization to the animal FAS 
systems (Cortes et al., 1990; Donadio et al. f 1991). Here, 
by comparing the 6dEB PKS domains with each other and 
with other multi- and monofunctional FAS and PKS sys- 
tems, we subdivide each eryA -encoded polypeptide into its 
constituent domains and propose a location for the DH 
and ER domains in EryAII. Conclusions similar to those 
reported here have been independently reached in the lab- 
oratory of P.F. Leadtay (personal communication) for the 
domain organization of EryAII and EryAIII and by Wit- 
kowski et al. (1991a) for the domain organization of rat 
FAS. 

RESULTS AND DISCUSSION 

(a) Extent of KS, AT and ACP domains 

The six SUs are organized in pairs in three large deduced 
aa sequences (Fig. 1). Within each polypeptide, the end of 
the N-terminal SU was arbitrarily placed where the se- 
quence C-terminal to the first ACP domain began to di- 
vei^e from that of the second ACP, located toward the 
C-terminal end of the polypeptide. In this way, SU 1 
through 6 were determined to be 1975, 1516, 1484, 1973, 
1480 and 1690 aa in length, respectively. The domain order 
of each SU is KS, AT, KR and ACP. SU1 has additional 
AT and ACP domains at its N terminus, SU4 contains DH 



and ER domains between AT and KR, and SU6 contains 
a TE at its C-terminal end (Fig. 1). We have used the dif- 
ferent compositions of the SU as a first approximation in 
establishing the extent of some of the domains and con- 
firmed the results obtained by comparison with the mono- 
fUnctional PKS proteins from the gra (Sherman et al., 1989) 
and tern (Bibb et al., 1989) clusters. In this way, the begin- 
nings and ends of the KS domains could be easily assigned 
to the highly conserved aa motifs d(e)Pi AiVgmaCR (upper- 
case letters refer to invariant residues) and GTNAHvIeE, 
respectively (Fig. 2). Comparison of the sequence of the 
first AT of module 1 (AT-S) to the previously aligned six 
other ATs clearly indicated the aa motif vIvFPGQGaQW 
as the likely beginning of the AT domains (Fig. 2). In a 
similar way, their end could be placed where AT-S is seen 
to diverge from the other ATs, a few residues after the 
highly conserved aa motif GVavdwxxa (Fig. 2). Com- 
parison of the eryA ATs with the only monofunctional se- 
quence available to us, a trahsacylase from Streptomyces 
glaucescens (R. Summers and C.R. Hutchinson, personal 
communication), confirmed the assignment of the N- 
terminal end and showed significant matches, albeit with 
two long gaps, up to the pvxLPt motif, just beyond the 
C-terminal end of the domains established by comparison 
oftheeo>4 ATs. 

When the segment encompassing the ACP active-site aa 
motif LGxDS from the first ACP of SU1 (ACP-S) was 
aligned with the six other eryA ACPs, the ends of the ACP 
domains appeared to coincide with the aa motif lAxhlxa, 



situated 38 aa after the active-site Ser (Fig. 2). Their starts 
could not be easily established by this criterion, however, 
since conservation between ACP-S and the other six ACPs 
began a few aa before the active-site motif. Comparison 
with monofunctional ACPs from gra and tern, however, 
indicated that significant matches began with the motif 
lAgisxxe and aided with the motif Glrtpxtlv, IS aa before 
the end established by comparison of the eryA ACPs alone 
(Fig. 2). This would place the N-terminal end in all of the 
ACP domains, except ACP-S, 45 aa upstream from the 
active-site Ser. 

(b) Conservation of KS, AT and ACP in multifunctional 
systems 

The aligned KS, AT and ACP domains from the six eryA 
SUs were compared with those found in other multifunc- 
tional systems, FAS from chicken (Yuan etal., 1988; 
Holzer et al., 1989) and rat (Amy et al., 1989X and the PKS 
6MSAS from Penicittium patulum (Beck et al., 1990). As 
expected, a higher overall degree of similarity was observed 
in the intra* than in the interdomain regions among the 
multifunctional systems analyzed (Fig. 2). The nine KS 
domains examined shared 71 invariant aa residues (out of 
425 aa) and, considering conservative substitutions, were 
similar for 20S/42S residues (Fig. 2). In contrast, AT and 
ACP domains exhibit lower conservation (Fig. 2; see also 
below). It is tempting to speculate that the basis for the 
apparently high sequence constraint in the KSs from dis- 
tantly related organisms is that, in addition to catalyzing 
the condensation of the acyl chain with the extender unit 
charged on the ACP to form /Mcetoacyl-ACP (Wakil, 1989), 
the KS is also responsible for the transacylation of the 
elongated acyl chain from the ACP to its own active-site 
Cys residue. 

Invariant aa accounted for 26/34S residues in the ten AT 
domains examined, and approx. 30% of the ten sequences 
involved conservative substitutions (Fig. 2). It is notewor- 
thy that, in addition to the segment around the signature aa 
sequence GHSxG, two additional segments, each contain- 
ing an invariant His, are highly conserved in the ten ATs. 
Serine proteases are known to contain active-site Ser and 
His residues, distant in the primary structure, but brought 
into close proximity in the folded protein (Hess, 1971). 
Since the types of reaction carried out by ATs and serine 
proteases are believed to be similar (McCarthy and Hardie, 
1984), the finding of invariant His in the ATs distant from 
the active-site Ser suggests a similarity with serine pro- 
teases also in catalytic mechanism. 

The ten ACP domains examined exhibit only one invari- 
ant residue outside of the LGxDS motif. The 30-aa N- 
terminal segment of the ACP domains from the SU shows 
some conservation with mono- and multifunctional pro- 
teins, except for ACP-S (data not shown). In their inde- 



53 

pendent study of domain organization of rat FAS, Wit- 
kowski et al. (1991a) have placed the N-terminal end of the 
ACP domain approx. 10 aa C-terminal to the end sug- 
gested here. Thus, the ACP-S domain lacks in its 
N-terminal portion a segment of at least IS aa when com- 
pared to the other ACPs. This apparent anomaly may re- 
flect a functional difference between ACP-S and the other 
ACPs. According to the model proposed for 6dEB syn- 
thesis (Donadio et al., 1991), the sole role of ACP-S con- 
sists of receiving the propionyi starter unit from AT-S and 
of transferring it to the KS of SU 1. Its function would thus 
be limited to acylation/deacylation, and this ACP would 
not be employed in carrying the 0-ketoacyl chain through 
the appropriate processing steps, as do all other FAS and 
PKS ACPs known. 

(c) Extent of ER, DH, KR and TE domains 

Only eryA module 4 encodes DH and ER functions, 
which, to date, have only been tentatively located in FAS 
systems (Tsukamoto and Wakil, 1988). A 400-aa segment 
unique to SU4 and believed to include the ER domain 
(Donadio et al., 1991), was used to search the databases. 
Surprisingly, the best matching sequences found, aside from 
the rat and chicken FASs, were from structural proteins of 
higher eukaryotes, C-crystallin from guinea pig lens (Ro- 
dokanaki etal., 1989) and the membrane protein VAT-1 
from Torpedo caKfornica synaptic vesicles (Linial etal., 
1989). The similarity of these two proteins to alcohol de- 
hydrogenases has already been reported. In addition, the 5 ' 
end of an URF divergently transcribed from the dnaB gene 
of Salmonella typhimurium (Wong et al., 1988) was detected 
by this search. Alignment of these sequences indicated that 
the ER domain is likely to extend for approx. 330 aa and 
contains 19 invariant and 90 conserved aa residues (Fig. 3). 
In particular, the sequence LxHxg(a)xGGVG, proposed as 
the NADPH-binding site for the rat ER (Amy et al., 1989; 
Witkowski et al., 1991a), appears to be highly conserved in 
the six sequences examined. It should be noted that data- 
base searches also indicated similarity between the ER 
domain and alcohol dehydrogenases, but this similarity is 
limited mainly to the putative NADPH binding site (data 
not shown). Although no enzymatic role has been assigned 
to C-crystallin or VAT-1, the high similarity detected among 
the six sequences suggests a possible present or former role 
for the two monofunctional proteins in reducing double 
bonds that lie to a carbonyi group (Piatigorsky and 
Wistow, 1991). 

The ca, 500-aa segment defined by the end of the AT and 
the beginning of the ER domain in SU4 showed some 
similarity to the corresponding segments from rat and 
chicken FAS (Donadio etal., 1991), as well as with an 
approx. 200-aa stretch located between the AT and KR 
domains of 6msas (data not shown), which is believed to 



54 




stpSEKeiH SEGRDAVAGL P' 
OSHEjpjlJtjEipl TSGRDSAAEV 
REGGETLSGF 
AEGANLTTGF -j 
SGGGDAIAEA 
VGRGDAVTEM 



NT- ; S 1 1 EI [ 



GB- s E X £ RNE 
DS- = EE;SFl 



HN- : a I ? 2E 1 




LOS LPHPDPTRSG TAHQRGGGS 
VPDE LHASD. . . .A AGTRAH-NE 
DLAR LHHPDPDHPG TSYVDK 
DIGR LYHPDPDNPG TSYVDK- 

EPD PDARL-GH 

OLD A LFDPDPQRflG TSYSRH- 



TEATA PI PJ E 

ag-go w - 

DD-AG^AH- 
TD-AOt| PC? 
| AA-GD^-AC- 
OG-AE-r AJr 




jfpKEphPQ EYGPRLABGG 
B \i SHQ G-ATGRPRPE 
[ * U AKF G-GEOTA.AA 
l^SGQ S-HQLLACEA 
U4VI3TV D-GPRPDEAP 
4#ffi WyQ G-GQDAWPB 




■ i n> tu 



- 700 
-2171 

- 225 
-1684 
* 222 
-1680 




GMLVt F 3RMN 
EVFTt -3RQG 
GVFVC-3R0R 
YTFVt - 3TQR 
GAFTE - *SQG 
EVFTE - 3RQG 




RI&flRRNGH 
R- - C I -RE- 
R--S -RH-H 
P- ■ I • -AH-H 
R--A:-RB« 
R- ■ C | -RA- 





SA pa. .ro: 

- LAT ^-ks-GSSi 
— LDT |--{rD-DADI 
IAT k-«D-DR 
LSI k-Ue-DPDI 
LAT H 




m 





H ^p^pT2E EAPQW EGBRVEAGO 

4a -P-BPEPVPQ PRR.iMLPAT 

A- ERE HRETTAHOG RPVpf 

A- A BQEAARTER 

-A- BAD EPEPAPD8G PV 

P-EPEPLPE PGPVGVLAAA 



'-\E 
MB 
\ B 
•I A 



WA^VL3A)3 SAEXflRA^flA RLAAHLREHP GQDPRDIAYS tBTGRAAIPH 

- - G RLADHLAAHP GIAPADVSWT I § RA-QHFEE 
W-A)l TTA$|R-4^ QIAELL.ERP DADLAGVGIG 1 1 rt-ARBBH 

R ALAEHLRDTP ELGLTDAAWT 1 1 TO-ARFDV 

- ■ G RLADHLAREP RMSLRDTGFT I § tR-SAWEH 

- ■ R LLES. .AVDD SVPLTALASA I ! Kr-AHLPR 



GPIf EVL-GE* 8EAWA-+ 

IVL-G* DEQS ft R- 
NSVHLL-AEl TETHA- 



HaMSpVOES aKLRVLDGIH liai.ADGAAV 
| — AV E AADT A EA/HR-RA\?| q-^WPGWT 
; - Ml li ASTRE E A /RG-REI 
* - I GDDRA C Y IAE-DAI 
^ - IAU S.DRD CALAG-RAV-j d-felADRTAT 
?-MtAGDHE ClfW-RAVi i - /AAPGATT 



J - fVATADAW 
£ - RPSADAVA 



8 PBOSRFf^EVfV 

1 ALSTE- 

2 APSLE- 

3 APGLD- 

4 PDPHE 
S 

6 APSLE 



ARPLD- - D- \ 



Qtfhstflpffi Eai 



D- 
D- 



-VL-JYHV 
-VL-JYKV 
-VL-S1MV 

d-m — n m imi - 




0ACARJ F BPV 
RE-ADjfrfcpH 
AE- 
RA-I 
SR- 
RD-1 
AE-DA\llJSEV 



DAVISEV 
OE! > APH 
AEJISPH 
ERIIAPH 




DS 

AEAARREQOA 

EPRPD 

RQAPG 

RGDGG 

SG 

EQRPD 



RlAljwaRlJRlP ^GtficfiKWv 
\l AX R£ R\ I AT MPCt«G | B £ I 
AFRSR/VRA KjUgOTSjIEV 
VVCR£RIN&S ffiSGEUCi^V 
VAIRJQMflE hjDDdaC |U£ V 
I A\ RE RM I AR |z|gGQ3C Hl$ 

yAi|R9u|ipA IgAG^ciaHi 



ALSAOEIBPR 
A...APAGEV 
RGGRSDVEKL 
ALGEAAVRER 
GASRDBLETV 
GLGTEQAABR 
AAPGERARAL 



IAt .RWDDD J 
RA..RIGDBU 
LAODSWTGRI 
LR. «PHQDB [ 
LA. .RWD&U 
ICRFAGAf 
XA 



- 992 
-2469 

- 518 
-1974 

- 515 
-1978 

- 106 
-1089 
-2559 

- 612 
-2066 

- 603 
-2068 

- 204 
-1184 
-2659 

- 710 
-2164 

- 701 
-2166 



VXjAbtf^PRg 



BXH*-C-PS 
-Oft 

SV?^-<-RS 
Mi | A- -C -C 1 
SI 7 St -C -P S 
S\ t A- -£ -« S 



fePESfV 



3D30EI 



BPGII 
*TAI1 



ARRVQELSAE 
DRLVASCTTE 
$)AQf(A RBFLBYCEGV 
RAFSEOCAAE 
DEFFAEAEAR 
DELIAECBAB 
AELVARCEDE 



C 1 RAKF I fV-D \ \i j S3IUBT 



C3RARAI?-D 
C 1 RVRC I 3-D 
XthKPRSIA-R 

C 1 TARF I ?-D )4H s H s ^cM B5 

C^RAK1I?-D 



2abc£^saS2a 

IRDMHAEtQ 
t^fI|UMBP V RDEjl VQA I A 
IER VREQIL.Eir 
AR fltEDqi AAE 1 G 
ILTELA 



SjSf^U^EE QREiytADgp .QJSARRAAX 



WfpPGGSEVP 
EC E HPLPGFV 
.C1TPRRAEV 
GCJAPRPARV 
.11TAVRGSV 
CXSPVSADV 




AV . DTRELVAtfi 
RW TQPDELDAC • 
DF LDGTELDAC ■ 
RS MDGTBLDAP* 
EV IDTSAMDAS • 
QP I DTATMDTA f 
BR RDGADM®b| 



-YEQ 
-YHl 

-YHI 
-YMI 
-TCP 



IjRLlfte - 303 
[f-RTjrla -1284 
-2758 
i-ETgjR - 809 
9-RPftL -2263 
- 800 
-2265 



S<gR 




2 QTLDAE. . ,G 
S EIGDG. . .SG 

3 ETLDD...AE 
S BAVEEAOGAE 

ETAEH. . .AG 
ATLOSALPAO 
EI. ..... .A 




F SQGGHMEL 
- SOGSIAGE 
-DAGDADPEL 
* DGGOLSAE L 



-EQSGPHEE 
- QRGGLAC E H 
-DTAE.Et I 



LAAAQAFTCg 
BALSR-FW I 
TALAD-HTll 
RSMAT-HVS | 
RNLLR-tM | 
TALGE-YAC | 
AELAR-HVI | 



V P V DWTAA YD 
-PVDWESVHL 
-FVDWEAVL. 
- [ 3RWDVAL. 
- ' ADLRPAV. 
- E V DHSPAF • 



LEPKPVARRS TE VDEVSALRY. , . RI EWRPTGA GBP ARLDGTKLVA 

^;;;;^° RT 79 r-eldgwf- rv o-tevpr sep aalrgrw-w 

-M S-BLA...- RV S-TPIBK PBS GHLDGDW-W 

PRPHRPADVS 152 88" 11 ► Y-GLAEQG-E YGPSFQALRA A-RKDDSVYA EVSIAADEBG YAFBPVL-DA 

L...PXPTGG RA R-EDDOWR- QV V-REABH BS AS1AGRV-LV 

L....APE.. .V S-QLADSR- RV D-RPL.A TTP VDLEGGF- V . 

oa cd a o a a ra 



DVGPNPAL 
GTGARRVPLP 
•GRAGLVD— 
.PGAAPFA— 
, AGGRPAE— 
.ADARPVE— 
.PAAPPVA— 
mmmt 
RYAGTADET3 
VPEGHEEDGM 
T.PLISPE.W 
VAQTLSLGAL 
TGPGVPSB.L 
..HGSAPESL 



T)|PT]QRBI<SltM 
G QGK- E - 

T SHQ- H - 

V--0RQ-Y- 
H-« EPO"^- 



TAA .RE ... . 
TVEVRS. .. . 
TBMLCE.... 
GEPGGGKLPF 
SOAIRS.... 
TS 



- 388 
-1381 
-2853 

- 906 
-2358 

- 898 
-2355 

-1439 
-2908 

- 957 
-2592 

- 952 
-2399 



55 



1 ALBSA GARVRELWD ARCGRDBLAE RLRSV.GE.V AGVLSLLAVD EAEPEEAPLA LASLADTLSL VQAMVSA ELGCPLW. . . 

2 ALAEA -A B PBVTRG.VGG 1VG0CA G-V-LLALE GO GAVQTLV- VRBLDAB GIDAPUf... 

3 AIHAN -GRALRCEVD TSASRTGMAQ AVAQAGT.GF RG-L-LLSSD BSACR..PGV PAGAVGLLT- VQALGDA GVDAPVH. . . 

4 ASfHTVTLHAS -ATSVR.WA TPAGADANAL RVTDPAGHLV AT-D-LWRS TGBKVEQPEP RGGEGELHA- DWGRLAEPGS TGRWAADAS DLDAVLRSGE 

5 GLECS -ATVLTCDVE ...SRSTIGT ALBAADTDAI, ST-V-LLSRD GBAVD..PSL DA LA- VQALGAA GVEAPLff... 

6 AVEKA -GRV...VPV ASADRBASAA -LREVP.GEV AG-L-V HTGAATHLA- HQSLGEA GVRAPLW. 



-15X6 
-2964 
-1033 
-2691 
-1021 
-2459 



1 TVT ESAVATGPFB RVRKAAHGAL KGVGRVTALE HPAVWSGLVD VPAGSVAELA 

2 i TV- FG — DAG.SP .VARPDQARL W-LGQVASL- RGPRHTGLV- LPHMPDPELR 

3 CL- QG — RTPADD DLARPAQTTA H-FAQVAGL- LPGRKGGW- LPBSVDDAAL 

4 PEPDAVLVRY EPEGDDPRAA ARHGVLHAAA LVRRMLEQEE LPGATLVIA- SG — TVSDDD SVPEPGAAAM tf-VXRCAQA- SPDRF.VLL- TDAEPG 

5 ....VL- RN-- QVADGB LV.DPAQANV G-LGRWGt- QPGRWGGLVr LVDA.DAASZ 

6 LV- SR— ALGESB .PVDPEQAKV W-LGRVHGL- TPBRW GGLV- LPAEPAPGDG 



1 RHLAAWSGG A.GEDQLALR 

2 GRLT-VLAG. . .SEDQV-V- 

3 RLLV-VLRGG GRAEDHL-V- 

4 .MLP-V PDKPQL-L- 

5 RSLA-VL.AD PRGEEQV-I- 

6 EAFV-CLGAD G.HEDQV-I- 

a c=a a csssa 

DADG AGBX.VAELEA 
DTBG VGD-TABLir 
GDD-LAAVER 
G SDE-RAEIEO 
ASE-REELTA 
AAE-EABLVA 




ADGVYGRRHV RAAAPA 

ADAVRAR-LS PAHVT. 

DGRLHGR-W RASLPQ 

GDDVFVP-LS PLAPSA* 

ADGIKVA-LV PAPARA 

DRARYGR-LV RAPLG. 
CZ3 O 



WBDRE6VRE 
-VSSREPVRE 
HBA...LRE 
- 1 P DRDALSA 



-315 aa- 




UADRARLEA 
- V I DREQLSK 



OLGGI.GODV Etx 
IVHGLIEQGD 111* 

PLGDE E l| 

ILDGL...PR PI 
ULAAERABGR 1U 
ILEELRGQGR PU 



TDDE.WEIG TW 
ATSEYAV - G- |-I 
SGSRSWH- 
-TQPAPVt - D- 
ARTR.KS - R- 

tressk^Ja- 

A TLDOGTflDTL 
G LPQOWIMDM Dl 
T LTKFGi 1 SEV API 
G VLADGI I TSI DBPi 
G VSTSTPIDDL 
G VPESRpgHEI GE 




. ARlQ&lfipE 

. AGljjflSE^II| 
. ADlbEFN: 

ARC | R Eli | W i 
AR<afqriw 



EfiTRELOLTA 
I CSDAEL. . 
- W LGDRAVER 
- [ r AKTCLSF 
I CPDLD. .A 
L CPDAET. . 



-1569 
-3015 
-1086 
-2786 
-1072 
-2511 

-1642 
-3086 
-1161 
-3171 
-1145 
-2584 

-1741 
-3184 
-1248 
-3268 
-1243 
-2680 




8 
1 
2 
3 
4 
5 
6 



^TtfflS SGHABG..AV ADRFRRlflVI EHPPETflCRA pgraflDRAEV 
GGM.TGDEEA VSFLREF | /R AMP VP P - 
L PGGAVD. .06 . .YLREF { LR SLSADRhpRT WRVHaAGPV 
ASEMTSGLG. . DRIAR1 § /A 
dL+b QHHAGD. .EG GEYLRSC \ LR 
C A- - 3 EGMATGDLEG . . .LTW \ LR 



ALPTEP - LAL 
AMD PDF - /EE 
PMAPEF - IRA 



FCSfl-RRGGE 
Lin-DKGQT 
U Qfl - DKGDT 



CPIVIDVRWD 
AWVTDVDtfP 
SVAVADVDVP 
WFPLSIHRS 
SVSWDMDRR 
CVSIADVDWE 



-1839 
-3283 
-1344 
-3366 
-1341 
-2777 



RFLLAYTAQR 
AFAESYT-AR 
VLSEGFA-TR 
ALR...R-BP 
RFVELFT-AR 
AFAVGFT-AR 

GRBflNIALLY 



PTIpJtoEIDD ARRAA...P0 APAEPRVGA. LASLPAPERB EALFELVRSH jffiAffeaHASA 
PRILIDRIVT ..TAPSERAG EPETESLROR L-GLPRAERT AE-VRLVRTS 1<TUI-HDDP 
PT/ LE AELAG RGGOAEAEPD SGPTGEPAQR L-GLSPDEQQ EN-LELVAHA V^EUt-HESA 
VP I VI RGHVR AKLRAAGOAE A.AGPNWDR L-GRSESDQV AG-AELVRSH *;AUE-YGSA 
HPiLFDEIAG ARAEA...RQ SEEGPALAQR L-ALSTAERR EH-AHLIRAE V * A U I - HGDD 
PR! LI DELVT PAVGAVPAV Q-APAREMTS QE-LEPTHSH ^mB 1 ^ 




euidsv 



= n 1 if L 



^M^AAI - 461 
1935 
3381 
1444 
3462 
1437 
2866 



| EH HI I GAAT 
| HI I NAAT 
KT I S AST 
: HI I GTAT 
i HI I AAVT 
^ « I OQAT 



S 

1 -vriljptTTVF 

2 -Li I PSTLVF 

3 -LflPASLVF 

4 -VI I PSTLVF 

5 -VEEAATWF D — TITRlJ-fc 

6 -LXIPATLVF 



DHPTPRAI B(E 
D-- DVRT1 
D~ 

D-- TVTA4 
D- 
D- 

B— TVRR1 



A 

-KASA+G 
0 

-tplaM-b 



-D 



ALAAG - 486 

HLAAEjljGGAT GAEQAAPATT -1975 

FLOAi-GTEV RG.EAPSA -3418 

HLRAJ-VG. . DAOOAAVRW GA -1484 

HLRDF-FAAS PAVDIGDRLD CLE -3505 

HYLEI ■ VGAA EAEOAPALVR EVP -1480 

HIGQC-DSGT P AREAS SALR DGY -2909 



Fig- ?. Alignments of the six eryA SU. The symbols on the left margin refer to the particular SU, with S referring to AT-S or ACP-S, as appropriate. 
Numbers on the ri&L mvtfn refer lo the aa sequence position at the end of each row in EryAI (for 1, 2 and S on left), in EryAII (for 3 and 4) and in 
EryAIII (for 5 and 6). Sequences for EryAI and for EryAII and -III are from GenBank, accession Nos. M63676 and M63677, respectively. Invariant 
aa residues in the six SU are marked by dashes. Dots refer lo computer-introduced gaps to imuimize alignments. Shaded boxes refer to aa residues in- 
variant in the six (or seven) sequences from the SU, as well as chicken FAS (Holier et aL, 1989; Yuan et ah, 1988), rat FAS (Amy et al., 1989) and 6MSAS 
(Beck et aL, 1990). Open boxes refer to conservative substitutions or invariant residues in all but one sequence. The N terminus of chicken FAS is as- 
sumed to precede the published sequence (Holzer et aL, 1989), as recently reported (WitkowsW et al., 1991a). The KR of SU3, when it deviates from 
the other eight sequences, is ignored for boxing purposes. The extent of each domain is indicated by undcrhning of the sequences with solid Week bars, 
short, heavy dashes, long, heavy dashes, and open bars, representing the KS, AT, ICR and ACT domains, respectively. The two arrows mark the extra 
segments of 152 and 3 1 S aa present in SU4, which are presented in Figs. 4 and 3, respectively. The shaded bars under the sequences in the region comprised 
between the two arrows indicate invariant and conservative substitutions among the six SU. Computer-assisted sequence analyses were performed using 
the University of Wisconsin GCG programs (Devereux etal, 1984). Sequences were examined pairwise using CXJMPARE/DOTPLOT. Multiple sequence 
alignments were performed using PILEUP, with a gap weight of 3.0 and a gap length weight of 0.1. The sequences of the six SU were initially aligned. 
Subsequently, the segments corresponding to the first AT and ACT of SU1 (AT-S and ACP-S, respectively) were mdividually aligned with the other six 
AT and ACT domains, respectively. Finally, the region of SU4 between the DH and ER domains (section c) and those from SU 1, 2, 3, 5 and 6 be- 
tween the AT and KR domains were separately ahgned. The three alignments generated in this way were manually combined using LINEUP. For comparing 
the six eryA SU with the other multifunctional systems examined, chicken FAS, rat FAS and 6MSAS, PILEUP was run using one sequence from each 
group (one SU, one FAS and 6msas) or with aO nine sequences, with similar results. The DH-ER mterdomain region of SU4 and the AT-KR interdomam 
regions from the other five SU. when compared with the other multifunctional systems, gave substantially different alignments upon changing of the PILEUP 
parameters. These segments have thus been ignored for boxing purposes. 



56 



chicken 

rat 

SU4 

cryst 
Vat-l 
Sty URF 

chicken 

rat 

SU4 

C- cryst 
Vat-l 
Sty DBF 

chicken 

rat 

SU4 

C- cryst 
Vat-l 

chicken 

rat 

5U4 

t- cryst 
Vat-l 



HNVYRDGKMG SFRHLPLQQA QPQELTEYAY VljfaTRBDLS 
HNVYRDGAMG AFRHFQLBCO KPBEQTAHAP VMLTR-DLA 
PQLALRGDDV FVPRLSPLAP SALTLPAGTQ RIVPGD-AID 
MATGQKLMRA IlWrEFHGPE 
MTGEEVKEP KEQQEITEVK EQEPEISYNA IUnNGWGYD 

MATR IE0HKHJGPE 



SVA 
MK 

VJJQ 



QLCKjvtaft 



£ I RWIVSPLR HFQ.TTNPNV 
^RWVSSPLK HMQPPSSSGA 
FEPAP DVEQPLRAGE 
. .VQSDV AVPIP.KDKQ 

VEVKK GVPTL.KSDE ILVFjvjd* 
. .T.VEF TPAEP.AEHE IOVE^ 



QLC1VYY 
VRVEVF* 
VLU V H * 



If 

lie ii 



]XMIA1 



KfflSPDAIPGN WTLCCCN 
B I SPDAIPGR WASiftjV 
MtPQKADM. 



GlElfiGW 

di(y]]Rd-| oMtripllpy tpgicvsgw 

DIIARC- Ap GKHHSL. . • .G1 EC BGW 
^|R%| I^PP.PSLPA GLGiJe^JGW 



. . LC ME F 5GR 
. . L- ME F SGR 
TAV-PDYDAP 
ESI-HDUSAF 
Efl I- DI V IDR 

SKcyNcJaEHi 




HGL LPAKC 
HGL VPAEC 
..G LFQG/ 
TTS TISGC 
IHLN IDGGI 
yYAQ STVGJ 



JtW DCDKRFLKEl 
frSV ILSPDFLMm 
fclA VTDHRLLAH I 
J5YA LASDHTVYFI 
Jelv VTTVNRTFI > 
EvB NVTADKAAlfe 



VEtiV TLEEAA 

- SS ft TLE £A- 
-DCUSDI DA- 

- E* I DFf QG- 

- DC 5- SF( EA- 
-pj^llSF BQA- 



5VEWX 
SVE* VX 



R IC 1 P X 
BIS U J 



YJl WRGCfK 
Y£ I WRGT 3 Q ! 
YJIKDLACIR 
RJ I E*HSAE f K 
V> 3 YDF W I R 
YjfcjOTYEyK 



SGGVC ASSHC 



QJ AIS 



IJflTQ 



HRLE* G LARV 
IALSIG.CRV 
LARRJG . AEV 
IARAK .LTV 
ICKL\ ¥ . DVT 
MAKAIQ.AKL 



irf^fiSAEKR E YLQAF E PQL CAKSFASSRN 
E1HGSAEXR AYLQA? E PQL CDTSFANSRD 
IMICPAKHO T, . .LF ? LGL DDEHIASSRB 
I C 1 ? G rEEGQ K...V.VLQN GAHEVFKHRD 
JEgtySPSKH E. . .tJiJKEN QVTYPIDYTT 



ALFLKNVAFH 
AIFLRNVTFH 
DLRDAGDFRG 
NPRDTMAKES 
DALQLINSKK 



G I LLD£ 3 FEE 
GILLDAXFEG 
P i XPFC I GEA 

1 i sgvs i fss 
^cgfkiIqrt 




GtfiE^E^I 
Ah DS 
GDDF 
TKEE 
DPE 



► AC G KHIGKV 
K AC G KHIGKV 
K SF G RHVGKL 
mSSGTVGK 
K RM RTLEKS 



iKFDLS NN SQLGH 

{KFDLS NN HPLGM 

KT 

RGSXEI 

JSANQV TAPKRSSLAA AKVWWHKFNI 

rfSKIQEEEKQ YPLRSEPVKL 
I UQVREEEPE AMLPGAQPTL 

VITQPAPVD 

1 ViLLM* 

SfejsLKSRQLM PQLEIKSVSK* 



Fig. 3. Putative cnoyl reductase domains. The segment from EryAH comprised between aa 2832 and 3138 was employed to screen the OenBank and 
EMBL (48285 sequences) and the Swissprot (20722 sequences) databases using the programs TFASTA and FASTA, respectively. Sequences showing 
significant matches were aligned using P1LEUP. Invariant aa residues are represented by dashes; dots refer to inserted gaps. Conserved regions among 
all the sequences are boxed. An asterisk indicates the slop codon in the corresponding gene. Sequences as indicated: chicken, FAS from chicken, aa 
1483-1853; rat, FAS from rat, aa 1496-1866; SU4, 5. erythraea EryAH, aa 2795-3 138; {-crysU guinea pig lens crystalline, complete sequence (Rodokanaki 
et al., 1989); VAT-1, membrane protein from Torpedo caUfomica cholinergic synaptic vesicles, complete sequence (Unial et a)., 1989); Sty URF, Salmo- 
nella typhlmurium unidentified open reading frame divergent from dnaB % translated from OenBank J03390 (Wong et at, 1988). Note that only the 5* end 
of this sequence is available, with the resulting polypeptide ending as KALGAKL 168 . 



contain a DH function (Beck et al., 1990). The alignment 
of these four sequences (Fig. 4) indicates that significant 
homology is limited to an approx. 150-aa segment. Within 
it, the invariant HxxxGxxxxP motif is embedded in a 25- 
aa segment with a high degree of conservative substitu- 
tions, involving mostly hydrophobic residues. The finding 
of an invariant His residue in the most conserved region 
among the four sequences is consistent with the proposed 
role for a His as the active-site residue in the E. coll 0- 
hydroxydecanoyl thioester dehydrase (Bloch, 1971). The 
corresponding gene encodes the active-she His in the se- 
quence HFIGDPVMP 78 (Cronan etal., 1988), where in- 
sertion of a gap between I and G would conform this se- 
quence to the proposed consensus. The HxxxGxxxxP motif 



is also found at a single position in the S. cerevisiae FAS 1 
sequence as HLSHGVKMIP 1057 , approximately where the 
DH domain has been tentatively placed (Schweizer et al., 
1986; Chirala et al., 1987). These observations suggest that 
the DH domain m multifunctional FAS and PICS systems 
is relatively short, extending for approx. 140-170 aa, con- 
sistent with the 170-aa size of the E. coti enzyme, and point 
to a specific His as one of the active-site residues involved 
in catalysis. The same His has been independently pro- 
posed by P.F. Leadlay (personal communication) as the 
catalytic residue in the DH domain of EryAIL It should 
also be noted that the two animal FASs and SU4 also 
share the motif GYxYGPxFQ, approx. 110 aa after the 
proposed active-site His, whereas 6msas does not. The 



chicken DHSQlflDVPK IaSd. . .FPSG SK^Adasjv? NldVjSPSSPD tawfltflcja MCYPAftJn] VlffiSbHLARS UjKVfi^TljQ P^W^fl T]flPKKGSTQ 

rat DHSQTO DIPV [AEjD . . .FPNG S.SSSSjlT-1 NIIASSESPD IYLVE-c4- R\ LF-GJaYll YuVWWl LARS LSLSIEETP- VFEBIilEHCf TI I PRTGTVP 

SU4 AOVSAZ GVRG IaEHPLLLAAV DVE C HOC A - J TGIIS1CE.Q P*LAE-vJc- R'i LV-GSMLvj DuAipp .GEO VGLE \ Z E EI - ZQREIVXAC; Gf I LRMSVGA 

6msas PLHTQI THDV IehHTLLGQRI PVECTlk^l TTiQd>^T.K p||{PGi{FSqJ T aiV-Afl OLll NTO^.TGG , . .QjljgN^ ^VI#UNJ10 RECJ . . .QVW 

■ ■ ■ 

Chicken iSvRIMPASH SFEVSGNGtil) A\BgKISiE|E NDALK^HNQ ifflDFOSQAHv YAKSGLLKEO RhfQEj2HL PjG^ *f^T F ^ 

rat I E VRLLEASH AFEVSOSGM IMSGKVYC^E DPDSKIEDH. ,IEVPIPA:S ESVSRLTQGE yJYKIIRL F-TI E toHFOGV 

SU4 E C ESGRRTID VHAAEDVAE I AEUQ >S QHATG1 1 AQG MAGPRMEQ WPPEDAVRIP IDDMD GLABC E^SFOAJlR AAWRKDDSVY 

6msas CHDQ^WSR LIPSEPSQIQ 00^3..,^ THTTA^qJjDRK ygGSBDRX^ AAVKSRLVTK yADl^SlDYL DiCVGVSA^tjj qj" y -fej T BHYRNDKEML 



Fig. 4. Putative dehydratase domains. The approx. 500- aa segments from EryAII, chicken FAS, rat FAS and 6msas suspected to contain the DH do- 
main (see section c) were aligned using PILEUP. Only the portion showing significant matches is represented. For abbreviations and symbols, refer to 
Fig. 3. The three blackened squares denote the putative active-site motif HxxxGxxxxP. The shaded bar denotes the highly conserved region common only 
to the two FAS sequences and SU4. Sequences: chicken, FAS from chicken, aa 812-987; rat, FAS from rat, aa 837-1009; SU4, S. erythraea EryAII, 
aa 2365-2551; 6msas, PenkilGum patutum 6MSAS, aa 914-1096. 



57 



reason for this difference and a possible role for this motif 
are at present unknown. No significant matches were de- 
tected by database searching with the proposed DH do- 
main or by comparison with known aa dehydratases. 

AO six eryA SU contain a segment corresponding to a 
KR domain, although the KR domain of SU3 is believed 
to be non-functional (Donadio et al., 1991). The beginning 
of the KR domains is likely to coincide with the region 
following the ER domain where SU4 realigns with the other 
five SU. This location was matched in the N-terminal por- 
tion of the monofunctional KRs involved in actinorhodin 
(Hallam et aL, 1988) and granaticin (Sherman et al., 1989) 
synthesis. Thus, the ery KR domains are likely to start with 
the PxGTvLv motif, just upstream from the putative 
NADPH-binding she (Fig. 2). Since in all multifunctional 
FAS and PKS systems the KR is always followed by an 
ACP, the end of the KR domains was placed approx. 
190 aa after the NADPH-binding site, where conservation 
among the nine sequences examined began to decline 
(Fig. 2). This interpretation results in the separation of the 
KR and ACP domains of 90-100 aa in the PKS systems 
and of 60 aa in the FAS systems. 

The C-tenninal end of SU6 contains a TE domain. This 
domain was compared with the corresponding domains 
from the two FAS sequences, with monofunctional 
thioesterases from rat (Randhawa and Smith, 1987; S af- 
ford et al., 1987) and duck (Poulose et al., 1985), and with 
the TE-like ORF downstream from eryF (Weber et al., 
1991). The alignments (Fig. 5) indicate that the TE domain 
in eryA extends for approx. 230 aa, and includes, in addi- 



tion to the invariant GxSxG motif common to ATs and 
serine proteases, the GdH motif found near the C-terminal 
end, which has been shown by site-directed mutagenesis to 
be essential for activity (Witkowski et al., 1991b). Overall, 
little similarity was detected among the six TEs analyzed, 
which may be related to the three different classes of sub- 
strates recognized by these enzymes (Wakil, 1989; Cortes 
etaL, 1990; Donadio a al., 1991). This is exemplified by 
the low similarity between the TE domain of rat FAS and 
the short chain TE from the same organism, and between 
the TE of SU6 and the other TE-like sequence from the ery 
cluster, although the role of the latter has not yet been 
determined. 

(d) Inter- and extradomain regions 

The overall domain organization of the three eryA- 
encoded polypeptides is summarized in Fig. 6. It can be 
seen that the largest interdomain regions are the five seg- 
ments between AT and KR, and the one between DH and 
ER in Ery AIL When the AT-KR interdomain regions from 
SU 1, 2, 3, 5 and 6 were compared with the region from 
SU4 containing the DH and ER domains, some similarity 
could be detected undo* relatively stringent conditions (data 
not shown). Computer-generated alignment of these six 
segments indicated that these regions can be best accom- 
modated after accounting for two insertions in SU4, the 
first of 152 aa, and the second of 3 15 aa (Fig. 2). These two 
insertions correspond very closely to the DH and ER do- 
mains, respectively, as determined above. In the 200-aa 
segment which joins the DH and ER domains in SU4, a 



rat 

chicken 
rat SC . 
duck SC 
ery ORF 
SU6 

rat 

chicken 
rat SC 
duck SC 
ery ORF 
SU6 

rat 

chicken 
rat SC 
duck SC 
ery ORF 
SU6 



AQLNLS I LLV NPEGPTLTRL NSVQSSERPL F I VK E IE GS I 
PKLDLNNLLV NPEGPTITRL NFVQSTERPL EIWEIEGSI 
METAVNAKSP RKEKVLNCLY QNPDAVFKLI CEPMGCGSI 
MDKVIARPY KRPNALCRLI CEPfiflGCNCS 
MSTWLRRFG PP VEHRARLV C E PH P G/ AA D 
GLSDFREHFD GSDGFSLDLV DMADGPGEVT MaCqSG^Aj}l 



TV. . 
AV.. 
H. .. 
F... 
S... 



.EABWC 

. f n wc 

.ill LA 
SGPHE$}n{tJ! 




V|PT>0IBCT QAA FjL.DSIP NLAAYYXDC J 
.►PCWCCT KAflPL.DSIQ SLAfSYxTDC K 
I \ KU ■ F E TR LGE f FANDIY dlApCXVTI I 
\IRL*-FECR DTE E FPEDMA HVVNEITNM 
VQYE-PCDR W3EEPLGTAG HIMDEVAAVI 
lypQEtpg. . EGf(gLPSSMA jfe^AVQADjfc 



KQVQPECSHR 
KQIQPECEYR 
LPIIC3MFA 
LKDLC E P E FA 
RASGCDCEFA 
IRTQGEpigFV 



VAGYSFGA C V 



IA-Y-F 
FF-H-F-i 
LF-H-F-! 
LF-H-M-J 
VA-H-A- 



Al I 



RVAAAVI I IT 
RVKAAAE I IT 
LKADAGV IKK 
LKEDQKU VT 
LRADYRV I RS 



DDTRLTI I 



'GA 



AfBftCSQLOA QQGPAPAHKN | 
AIBKCSO-QA QQNA5HALN5 
Al II I LL-KE KYKMEPLH. • 
SI Al 1 VH-KE KHGLEPVK. . 
M El I RR-ER EPGGGPLR. . 

mJaiHte-ld rghpprgv. . j 

RSKQSLORRD 
QIHKNIKREA 
FIFDKPSKAL 
YPHHDVRKKY 
YAWADGPP . . 
YDRLTGQWRP 



JFOGSHTY VLAYT QSYRAKL TPGCEAEAEA 

&DGSHSF VAAYTQCFSF SLFQSYRAKL TQGNBAALET 

ISGASAPK STSRPQVPOL HELTE 

SSGSYGPH SEYFHLMYKL PBVED 

JSGQTAPR VHBRR..TDL PG. .D 

J[>V Y PPGHQ 



E^Ffjikdi ^aehsi^Ile aiEplksled 

EALCAEMQCE TCIEYNKILE II I PLED LEA 
EC U RHf I L£ F CCTPKHIIED Ql \ LRMFIPL 
SF L LEI 1 HI I C C TPPEJE I QN EC ' TKHLLRV 
DCLYDEIRTI . C TSEAJI I AD EH Z LAMSLPV 
DA^IAV@EeQ l|gT...I@DR ElU RM 




KLRAADQYKP KAKYHGNVIL LRAKTGGTYG EDLGADYHS 
KLKAADKYIP ESKYHGNVTL KRAKTKNEYB EGLGGDYE IS 

CSEDTI K...DIEC5«a 

GSDEKN H...GSBJ&I 

GDADPL TATGDAEE V L 

.......... . VSAGEP KGPWPDDsjjJK 



QVCDGKVSVH IIEGDKR1IL EGRGLES I IN 
EVCDGKVSVH I3E-D-RTIL EGDGVES I IG 
DLTSGKFDVH e^PhD^F^llM KPDNENpJljKN 
AITSGDTSIY 
QHSVIPGRTR 
PTWPFERDTV fiKfl-D-lETMV QEKADAIJgRH 



TE E-G- 



SIE-N-FYIM EPSNETEIIK 



HIG EQVTE. 



vat 1 IHSSLAEPR VSVREG* 

chicken IIBGSLAEPR VSVREG* 

rat SC 1 IAKCLELSS LT* 

duck SC 5 ITKCIENSD I* 

ery ORF JVRRDLLRAG LAG* 

su6 Isdahlggghs * 

Fig. 5. Taioesterase domains. The six TEs were compared using PILEUP. See legend to F«. 3 for symbols. Sequences: ^™J™J*> ** ^ 
2505; chicken, FAS from chicken, aa 2193-2497; rat SC, short<aain TE from rat, complete sequence (Randhawa and Smith, 1987; Safford I et aL, 1987), 
duck SC, short^ham TE from duck, complete sequence (Poulose et al., 1985); eryORF, downstream from 5. erythraea eryF (Weber el al., 1991; GenBank 
accession No. M54983); SU6, S. erythraea EryAIli, aa 2927-3170. 



58 



1000 2000 3000 aa 

i i i I — i i i i \ i i i i 



J I L_ 



-a 



H=>-a*i 



31 — 



EryAI 
&y au 
EfyAfll 



AT KB 

CD mss 



a ie 
ESS □ 



Fig. 6. Domain organization in 6dEB synthase. Each domain is repre- 
sented by a rectangle of different filling as shown, whose length is pro- 
portional to the length of the domain. Note the partial filling of the first 
KR of EryAII, which denotes an inactive KR (Donadio et a!., 1991). 



stretch of approx. 80 aa appears to be fairly well conserved 
among the six SU (Fig. 2). The enzyme from S. erythraea 
AKR5, which has been deleted of this 80-aa segment along 
with the KR domain C-terminal to it in SU5, retains ac- 
tive SU5 and SU6, as judged by the ability to produce a 
significant amount of the 6dEB analog lacking the bydroxyl 
group introduced during synthesis step S (Donadio et al., 
1991; Fig. 1). Whilst Witkowski et al, (1991a) have spec- 
ulated that the long DH-ER interdomain segment is in- 
volved in facilitating protein-protein interactions in the 
dimeric FAS enzyme, our results indicate that the presence 
of at least a portion of this segment is not absolutely re- 
quired for 6dEB PKS function, although the level of ac- 
tivity of the altered enzyme could not be measured directly. 
That the deleted segment is important for KR activity is 
improbable, since very little homology was detected with 
other multifunctional systems in this region (data not 
shown). 

An additional feature of the eijv4-encoded polypeptides 
is the presence of extra N-terminal and C-terminal tails 
extending significantly beyond the domain limits (Fig. 6). 
The N termini of polypeptides EryAII and EryAIII contain 
segments of 26 and 33 aa preceding the KS domains of 
SU3 and SU5, respectively, and the C termini of EryAI 
and EryAII contain stretches of 69 and 63 aa following the 
ACP domains of SU2 and SU4, respectively. The other 
multifunctional systems examined do not contain extra tails 
of such lengths. In 6msas a 28-aa segment precedes the KS 
domain, but the ACP domain is followed by a segment of 
only 6 aa at the C terminus of the polypeptide. The KS 
domain of rat FAS starts at the N terminus of the protein. 
It is tempting to speculate that these additional segments 
in the er^-encoded polypeptides may play a role in facil- 
itating the correct intermolecular transfer of the growing 
acyl chain, such as from SU2 in EryAI to SU3 in EryAIII, 
either by enabling specific proton-protein interactions, or 



by properly positioning the polypeptides on some cellular 
structure. 

(e) Evolution of the modules 

S. erythraea contains a Type-I PKS and, most likdy, a 
Type-II FAS system (Revill and Leadlay, 1991). The ev- 
olutionary origin of these two systems can be understood 
by comparison of similar enzymatic functions belonging to 
a Type-I or Type-n system from different sources, as ex- 
emplified for the 21 A CPs presented in the dendrogram in 
Fig. 7. The eryA ACPs are closely related to each other, 
except for ACP-S, which, as described above, does not 
function as all other known ACPs and is less related to the 
other SU ACPs than the ACP from 6msas. Nonetheless, 
the Type-I PKS ACPs appear to be clustered together in- 
dicating greater overall similarity amongst each other than 
with ACPs from other systems. Similarly, the Type-II PKS 
ACPs form their own cluster as do both the Type-I and the 
Type-II FAS ACPs (Fig. 7), The determination that the 
SU ACPs more closely resemble Type-I FAS systems than 
the monofunctional FAS ACP from the same host suggests 
that Type-I PKSs and Type-I FASs had a common an- 
cestor. This hypothesis is corroborated by the observation 
of a similar pattern when the six eryA KSs were compared 



J 



ft 



-SIM 

- SU4 

- SU3 

- SU5 

- SU2 

■ SU6 

- fknaaa 

■ SU1-S 

- rat 

- chicken 

• Tew 
Gra 

• WhIE 
-Barieyl 

• Barley III 

• Rape 

• RmeKtati 
■Ecofi 

• Synechocystls 
S. erythraea , 
NodF 



PKS. Type-I 

FAS, Type-I 
PKS, Type-II 

FAS, Type-II 
Nodulation, Type-II 

Fig. 7. Rdatedness of ACPs and ACP domains. The ACP domains from 
multifunctional FAS and PKS systems (see Fig. 2) and the monofunc- 
tional ACPs are shown as the dendrogram obtained from PILEUP. ACP 
sequences: WhiE, spore-associated pigment genes from Streptomyces 
coeticobr (Davis and dialer, 1990); Barley I and III, forms I and III 
(Hansen, 1987); Rape, Brasska napus plasud seed (Safford et at, 1988); 
R. melitoti, Rhizobium meUbti constitutive ACP (Piatt et al„ 1990); E. coli, 
E. coB FAS ACP (Holak et aL, 1988); Synechocystis, Synechocystis 6803 
(Froehlich et al., 1990); S. erythraea, S. erythraea putative FAS ACP (Re- 
vill and Leadlay, 1991); NodF, R. meliloti nodulation-specific ACP (De- 
belle and Sharma, 1986). 



59 



with eight other Type-I and Type-II sequences (data not 
shown). 

The finding of a stretch of the DH-ER interdomain re- 
gion in the SU lacking these two functions is also consis- 
tent with the hypothesis that the eryA modules are likely to 
have evolved from an ancestral element (FAS- or PKS- 
like) which encoded the full set of activities involved in the 
processing of the /J-carbonyl (DH, ER and KR), followed 
by loss of the functions not required at particular steps of 
6dEB synthesis. Two modes of specialization through loss 
of function seem to have occurred in the eryA modules: 
selected mutations in the KR-encoding domain in module 
3, and loss of the DH- and ER-encoding segments in all of 
the modules except module 4. Loss of function (ER) 
through extensive deletion may have also taken place in 
6msas. It will be interesting to analyze the sequences of 
other PKS systems lacking ICR, DH or ER domains to 
better understand the mode of evolution of pathways for 
complex polykettdes. 

(f) Conclusions 

Our results on the extent of the various domains in the 
six eryA SU, determined solely by computer-assisted align- 
ments, can be extended to other related systems and are 
substantially in agreement with those independently found 
by P.F. Leadlay and colleagues (personal communication) 
and by Witkowski etal. (1991a), who corroborated their 
computer analysis with limited proteolysis studies. The ex- 
istence of multiple sequences with identical function in eryA 
has greatly facilitated assignments of the various domains. 
We have proposed a location for the DH domain and a 
putative active-site His for it Type-I FAS and PKS sys- 
tems also seem to share a common origin independent of 
their prokaryotic or eukaryotic source. 

ACKNOWLEDGEMENTS 

We thank Dick Hutchinson, Peter Leadlay, Stuart Smith 
and Rich Sumniers for providing us with results prior to 
publication. 

REFERENCES 

Amy, CM., Witkowski, A. f Naggert, J. ( Williams, B., Randbawa, Z. and 
Smith, S.: Molecular cloning and sequencing of cDNAs encoding the 
entire rat fatty acid synthase. Proc. Natl. Acad. Sci. USA 86 (1989) 
3114-3118. 

Beck, J„ Ripka, S., Siegner, A., Schiltz, E and Schwdzer, E: The mul- 
tifunctional 6-methyisalicylic acid synthase gene of PeniciUium patutum . 
Its gene structure relative to that of other polyketide synthases. Eur. 
J. Biochem. 192 (1990) 487*498. 

Bibb, MJ., Biro, S., Motamedi, H., Collins, J.F. and Hutchinson, C.R.: 
Analysis of the nucleotide sequence of the Streptomyces gbtcescens 
tcml genes provides key information about the enzymology of poly- 
ketide antibiotic biosynthesis. EMBO J. 8 (1989) 2727-2736. 



Bloch, K.: ^Hydrojqdecanoyl thioester dehydrase. In: Boyer, P.D. (Ed.), 
The Enzymes, 3rd ed., Vol. V. Academic Press, New York. 1971 
pp. 441-464. 

Chirala. S.S., Kuziora, MJV., Spector, D.M. and Waku\ S J.: Comple- 
mentation of mutations and nucleotide sequence of FAS1 gene en- 
coding the /^summit of yeast fatty acid synthase. J. Biol. Chem. 262 
(1987)4321-4340. 

Cronan Jr^J.E, Li, W.-B., Coleman, R„ Narasihman, M.. de Mendoza, 
D. and Schwab, J.M.: Derived amino acid sequence and identifica- 
tion of the active site residues of Escherichia co!i ^hydroxydecanoyl 
thioester dehydrase. J. Biol Chem. 263 (1988) 4641-4646. 

Cortes, J., Haydock, S.H., Roberts, Gj\., Bevitt, D J. and Leadlay, P.F.: 
An unusually large multifunctional polypeptide in the erythromycin- 
productng polyketide synthase oTSaccharopofyspora erythraea. Nature 
348 (1990) 176-178. 

Davis, N.K. and Chater, K.F.: Spore colour in Streptomyces coellcalor 
A3(2) involves the developmental^ regulated synthesis of a com- 
pound Kosynmetkauy related to polyketide amniotics. Mol. Micro- 
biol. 4 (1990) 1679-1691. 

DebeQe, S. and Shanna, S.B.: Nucleotide sequence otRhizobim metilotl 
RCR201 1 genes involved in host specificity of nodulation. Nucleic 
Adds Res. 14 (1986) 7453-7472. 

Devereux, J., Haeberli, P. and Smithies, O.: A comprehensive set of se- 
quence analysis programs for the VAX. Nucleic Acids Res. 12 (1984) 
387-395. 

Donadio, S„ Staver, MJ., McAlpine, J.B., Swanson, S.J. and Katz, L: 
Modular organization of genes required for complex polyketide bio- 
synthesis. Science 252 (1991) 675-679. 

Froehfich, J.E, Poorman, R., Reardon, E, Bamum, S.R. and Jaworski, 
J .G.: Purification and characterization of acyl carrier protein from two 
cyanobacteria species. Eur. J. Biochem. 193 (1990) 817-825. 

Hallam, S.E., Malpartida, F. and Hop wood, D.A.: Nucleotide sequence, 
transcription and deduced function of a gene involved in polyketide 
antibiotic synthesis in Streptomyces coelteohr. Gene 74 (1988) 305- 
320. 

Hansen, L.: Three cDNA clones for barley leaf acyl carrier protein I and 
HI. Carlsberg Res. Commun. 52 (1987) 381-392. 

Hess, G.P.: Chymotrypsin - chemical properties and catalysis. In: Boyer, 
P.D. (Ed.), The Enzymes, 3rd ed„ Vol. III. Academic Press, New 
York, 1971, pp. 213-248. 

Holak, T. A., Nilges, M., Prestegard, J.H., Gronenborg, A.M. and Ctore, 
G.M.: Three-dimensional structure of acyl carrier protein in solution 
determined by nuclear magnetic resonance and the combined use of 
dynamical simulated annealing and distance geometry. Eur. J. Bio- 
chem. 175 (1988) 9-15. 

Holzer, K., Liu, W. and Hammes, G.H.: Molecular cloning and sequenc- 
ing of chicken liver fatty acid synthase cDNA. Proc. Natl. Acad. Sci. 
USA 86(1989)4387-4391. 

Linial, M., Miller, K. and Scheller, R.H.: VAT-1. An abundant membrane 
protein from torpedo colinergic synaptic vesicles. Neuron 2 (1989) 
1265-1273. 

McCarthy, A.D. and Hardie, D.G.: Fatty acid synthase - an example of 
protein evolution by gene fusion. Trends Biochem. Sci. 9 (1984) 60- 
63. 

Piatigorsky, J. and Wistow, G.: The recruitment of crystaDins: new func- 
tions precede gene duplication. Science 252 (1991) 1078-1079. 

Piatt, M.W., Miller, K J., Lane, W.S. and Kennedy, EP.: Isolation and 
characterization of the constitutive acyl carrier protein from Rhizobium 
metibti. J. Bacterid. 172 (1990) 5440-5444. 

Poulose, AJ„ Rogers, L., Cheesbrough, T.M. and Kolattukudy, P.E: 
Cloning and sequencing of the cDNA for S-acyl fatly acid synthase 
thioeslcrase from the uropygial gland of mallard duck. J. Biol. Chem. 
260(1985) 15953-15958. 



60 



Randhawa, Z.I. and Smith, S.: Complete amino acid sequence of the 
medium- chain 5-acyi fatly acid synthetase thio ester hydrolase from 
rat mammary gland. Biochemistry 26 (1987) 1365-1373. 

RcvilU W.P. and Leadlay, P.F.: Cloning, characterization and high level 
expression in Escherichia coll of the Saccharapolyspora erythraea gene 
encoding an acyt carrier protein potentially involved in fatty acid 
biosynthesis. J. Bacterid. 173 (1991) 4379-4385. 

Rodokanaki, A., Holmes, R.K. and Borras, T.: Zeta crystallin, a novel 
protein from guinea pig lens is related to alcohol dehydrogenase. Gene 
78 (1989) 215-224. 

Safford, R., de Suva, J., Lucas, C, Windust, J.H.C, Shedden, J„ James, 
CM. % Sidebottom, CM., Slabas, A.R., Tombs, M.P. and Huges, 
S.G.: Molecular cloning and sequence analysis of complementary 
DNA encoding rat mammary gland medium-chain S-acyl fatty add 
symhetaso thio ester hydrolase. Biochemistry 26 (1987) 1358-1364. 

Safford, It, Windust, J.H.C., Lucas, C, de Silva, J., James, CM., Hcl- 
lyer, A M Smith, C.G., Slabas, A.R. and Huges, S.G.: Plastid-localiscd 
seed acyl -carrier protein of Brasska napus is encoded by a distinct, 
nuclear multigene family. Eur. J. Biochem. 174 (1988) 287-295. 

Schweizcr, M., Roberts, L.M.. Hottke, H.-J., Takabayashi, K., Hollerer, 
E M Hoffmann, B M Mullcr, G., Kotlig, H. and Schweizer. E.: The 
pentafunctional FAS 1 gene of yeast: its nucleotide sequence and order 
of catalytic domains. Mol. Gen. Genet. 203 (1986) 479-486. 



Sherman, D.H., Malpartida, F., Bibb, M J., Kieser, H.H., Bibb, M J. and 
Hopwood, D.A.: Structure and deduced function of the granaticin- 
producing polyketide synthase gene cluster of Streptomyces 
vtolaceoruber Tu21 EMBO J. 8 (1989) 2717-2725. 

Tsukamoto, Y. and Wakil, SJ.: Isolation and mapping of the 
0-hydroxyacyl dehydratase activity of chicken liver fatty acid syn- 
thase. J Biol. Cbem. 263 (1988) 16225*16229. 

Wakil, S.J.: Fatly acid synthase, a proficient multifunctional enzyme. 
Biochemistry 28 (1989) 4523-4530. 

Weber, J.M., Leung, J.O., Swanson, S J., Idler, K.B. and McAlpine, J.B.: 
An erythromycin derivative produced by targeted gene disruption in 
Sacchampofyspora erythraea. Science 252 (1991) 114-117. 

Witkowski, A., Rangan, V.S., Randhawa, Z,I„ Amy, CM. and Smith, S.: 
Structural organization of the multifunctional animal fatty-acid syn- 
thase. Eur. J. Biochem. 198 (1991a) 571-579. 

Witkowski, A., Naggert, J„ Wessa, B. and Smith, S.: A catalytic role for 
histldine-237 in rat mammary gland thioesterase II. J. Bk>L Chem. 266 
(1991b) 18514-18519. 

Wong, A., Kean, L. and Maurer, R.: Sequence of the dnaB gene of Sal- 
monella typhtmurtum. J. Bacteriol. 170 (1988) 2668-2675. 

Yuan, Z., Liu, W. and Hammes, G.H.: Molecular cloning and sequenc- 
ing of DNA complementary to chicken liver fatty acid synthase 
mRNA. Proc. Natl. Acad. Sci. USA 85 (1988)6328-6331. 



Journal of Bacteriology, Sept. 1999, p. 5600-5605 Vol i«i \u ia 

0021-9193/99/S04.00+0 ' °' 18 

Copyright © !999, American Society for Microbiology. All Rights Reserved. 

Insertional Inactivation of Methylmalonyl Coenzyme A (Co A) 

Mutase and Isobutyryl-CoA Mutase Genes in Streptomyces 
cinnamonensis: Influence on Polyketide Antibiotic Biosynthesis 

JAN W. VRIJBLOED, KATJA ZERBE-BURKHARDT, ANANDA RATNATILLEKE 
ANDREAS GRUBELNIK-LEISER, and JOHN A. ROBINSON* 
Department of Chemistry, University of Zurich, CH-8057 Zurich, Switzerland 
Received 26 April 1999/Accepted 15 July 1999 

The coenzyme B 12 -dependent isobutyryl coenzyme A (CoA) mutase (ICM) and methylmalonyl-CoA mutase 
(MCM) catalyze the isomerization of n-butyryl-CoA to isobutyryl-CoA and of methylmalonyl-CoA to succinyl- 
CoA, respectively. The influence that both mutases have on the conversion of n- and isobutyryl -CoA to 
methylmalonyl-CoA and the use of the latter in polyketide biosynthesis have been investigated with the 
polyether antibiotic (monensin) producer Streptomyces cinnamonensis. Mutants prepared by inserting a hygro- 
mycin resistance gene (hygB) into either icmA or mutB, encoding the large subunits of ICM and MCM, 
respectively, have been characterized. The icmAvJiygB mutant was unable to grow on valine or isobutyrate as 
the sole carbon source but grew normally on butyrate, indicating a key role for ICM in valine and isobutyrate 
metabolism in minimal medium. The mutBwhygB mutant was unable to grow on propionate and grew only 
weakly on butyrate and isobutyrate as sole carbon sources. ,3 C-labeling experiments show that in both mutants 
butyrate and acetoacetate may be incorporated into the propionate units in monensin A without cleavage to 
acetate units. Hence, «-butyryl-CoA may be converted into methylmalonyl-CoA through a carbon skeleton 
rearrangement for which neither ICM nor MCM alone is essential. 



Streptomycetes produce a wide variety of commercially im- 
portant polyketide secondary metabolites, including the weil- 
known macrolide and polyether antibiotics. The assembly of 
these antibiotics, catalyzed by the large family of polyketide 
synthases, requires malonyl coenzyme A (CoA), methylmalo- 
nyl-CoA, and ethylmalonyl-CoA as extender units, which are 
the precursors of the acetate-, propionate-, and butyrate-de- 
rived units, respectively, in the polyketide backbone (8, 11, 12). 

Fatty acid and amino acid catabolisms are known to play an 
important role in supplying building blocks for polyketide bio- 
synthesis. n-Butyryl-CoA and isobutyryl -CoA are known inter- 
mediates in fatty acid and valine catabolism, and both may be 
converted intact into methylmalonyl-CoA and ethylmalonyl- 
CoA in streptomycetes (Fig. 1). This is evidenced by the 
efficient incorporation of ,3 C-labeled «- and isobutyrate into 
both the propionate- and butyrate-derived units in several 
polyketide antibiotics, including monensin A (20, 22), tylosin 
(18), and narasin (6). The coenzyme B l2 -dependent isobutyryl- 
CoA mutase (ICM) appears to play a key role in these pro- 
cesses, by catalyzing the interconversion of n- and isobutyryl- 
CoA. However, the preferred route(s) for the conversion of 
at- and isobutyryl-CoA into methylmalonyl-CoA is less clearly 
defined. One possibility, consistent with results of ,3 C-labeling 
studies, is the direct oxidation of isobutyryl-CoA, in several 
enzymatic steps, perhaps via methacrylyl-CoA and 3-hydroxy- 
butyryl-CoA, to give methylmalonyl-CoA (path A in Fig. 1), 
although this route is different from the well-known pathway of 
valine catabolism in pseudomonads and mammals that leads 
via methylmalonic acid semialdehyde to propionyl-CoA (10, 
15, 32) (Fig. 1 ). Another possibility is that n-butyryl-CoA might 
be oxidized at C-4 (gj oxidation) over several steps without 



•Corresponding author. Mailing address: Institute of Organic 
Chemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 
Zurich, Switzerland. Phone: (0041)- 1 -635-4242. Fax: (0041)1-635- 
6812. E-mail: robinson@oci.unizh.ch. 



fragmentation, to give succinyl-CoA, which may then be con- 
verted to methylmaionyl-CoA by methylmalonyl-CoA mutase 
(MCM) (path B in Fig. 1). Known routes to methylmalonyl- 
CoA presently include the carboxylation of propionyl-CoA, 
catalyzed by propionyl-CoA carboxylase, and the isomerization 
of succinyl-CoA, catalyzed by the coenzyme B, 2 -dependent 
MCM. Ethylmalonyl-CoA appears to be derived solely by car- 
boxylation of H-butyryl-CoA. 

To further explore the roles of MCM and ICM in the pro- 
duction of methylmalonyl-CoA, we report here the properties 
of Streptomyces cinnamonensis mutants in which either MCM 
or ICM has been inactivated by gene disruption. The genes 
encoding both enzymes have been cloned and sequenced from 
S. cinnamonensis, the producer of the polyether antibiotic mo- 
nensin A (Fig. 1) (2, 33). MCM consists of two subunits, MutA 
and MutB, of 65 and 79 kDa, respectively, which show high 
sequence similarities to MCMs from other bacteria and mam- 
mals. So far, ICM has been isolated only from S. cinnamonen- 
sis, where it was found to comprise a large subunit of 65 kDa 
(IcmA) and a small subunit of ca. 17 kDa (IcmB) (33). The 
cloning and insertional inactivation of icmA from S. cinna- 
monensis were reported earlier (33). In this work, a mutant was 
prepared by inserting a hygromycin resistance gene (hygB) into 
mutB. The growth characteristics of both icmA::hygB and 
mutB::hygB mutants and the pattern of incorporation of Re- 
labeled precursors into the polyketide monensin A in each are 
reported below. The results indicate a key role for ICM in 
valine and isobutyrate metabolism under normal growth con- 
ditions. 

MATERIALS AND METHODS 

Bacteria! strains, plasmids, and fermentations. The bacterial strains and plas- 
mids used in this study are listed in Table I, S. cinnamonensis A3823.5. a 
high-yield producer of monensin A, was a gift of Hi Lilly & Co. (Indianapolis. 
Ind.). Phenotypic analysis of S. cinnamonensis strains was performed on solid 
minimal medium (9) with ammonium sulfate as nitrogen source and various 
carbon sources (50 mM). The growth was monitored after 7 to 10 days at 30°C. 



5600 



Vol. 181, 1999 



MUTASES AND METHYLMALONYL-CoA 5601 



Valine 

catabolism in 
Pseudomonadsl 
i & mammals 



COOH 



i Methylmalonic acid 
semialdehyde 
dehydrogenase 



CO-SCoA 



J 



Me' 
Propionyl-CoA 



+ C02 



odd-Vziiy acids 




©nh 3 
Valine 



even-Fatty acids 



Me 1 



ICM + B l2 



Me" ^CO-SCoA 
Isobutyryl-CoA 



Me' 



XO-SCoA 



path-A 



n-Butyryl-CoA 

Ijpath-B 



COOH 

Me^^^CO-SCoA 

Methylmalonyl-CoA 

i 



MCM + B l2 



HOOC' 



,CO-SCoA 



Succinyl-CoA 



XO-SCoA 
Me^ 

Acetyl-CoA 



glyoxylate and 
citric acid cycles 



Polyketide biosynthesis 




Me, 



COOH 

^CO-SCoA 
Ethylmalonyl-CoA 
COOH 



Monensin A 

FIG. 1. Pathways of valine catabolism and the reactions catalyzed by MCM and ICM. 



'CO-SCoA 
Malonyl-CoA 



Chemicals. Chemicals were purchased from FUika, Buchs, Switzerland. So- 
dium (l- L, C]butyrate and ethyl [l,3- ,3 C 2 ]acetoacetate were from Cambridge 
Isotope Laboratories, Inc. Sodium [1- ,J C)isobutyrate was prepared as described 
earlier (22). 

lnsertional inactivation of mutB. In order to disrupt mutB in S. cinnamonensis y 
a 2.55-kb Bam\\\ fragment (2) containing almost the entire mutB gene was 
cloned into the BamHX site of pUC18 to generate pOCI403. Subsequently, a 
hygromycin resistance gene (hygB) was cloned as a BamHUBglll fragment into 
the unique BglU site of pOCI403 to generate pOCI352. The flam HI fragment of 
pOCI352 was isolated and cloned into the unique BamHl cloning site of 
pGM 160. The resulting plasmid, pOC1353, after passage through Escherichia coli 
ET1 2567 and Streptomyces lividans TK64, was used to transform S. cinnamonen- 
sis A3823.5. Selection for the disrupted mutB gene was performed as previously 
described (33). 

Incorporation of ,J C-labeled precursors into monensin A. Singly labeled pre- 
cursors (100 mg per culture) were fed balchwise in equal portions. after 24, 36. 
and 48 h to two 60-ml cultures in a complex liquid medium (22. 28). Subse- 
quently, monensin A was purified from the culture broth as in earlier work (22) 
and examined by inverse-gated ,3 C{'H} nuclear magnetic resonance (NMR) 
spectroscopy (20-s relaxation delay, 30° pulse) on a Bruker AMX600 spectrom- 
eter. Enrichment levels were determined by signal integration. The ethyl (1,3- 
1 ^Cjlacetoacetate (100 mg) was mixed with unlabeled ethyl acetoacetate (200 
mg) and then batch fed to two liquid cultures (150 mg per culture) in three equal 
portions after 48, 60, and 72 h of growth. Typically, 30 to 70 mg of pure monensin 
A was obtained from each experiment. 



RESULTS 

lnsertional inactivation of S. cinnamonensis mutB. Disrup- 
tion of mutB was achieved by cloning a hygromycin resistance 
gene, hygB, into the unique BglU site of mutB (2). A restriction 
fragment containing the disrupted mutB and hygB genes was 
then cloned into pGM160, which carries also the thiostrepton 
resistance gene. The resulting plasmid, pOCI353, was used to 
transform protoplasts of S. cinnamonensis A3823.5. Isolation 
of the desired mutant was performed by selecting for Ts s and 
Hg r colonies at 39°C, as in earlier work (33). A Southern blot 
hybridization analysis with total DNA isolated from one clone 
(mutB::hygB) confirmed that the mutB gene had been dis- 
rupted, consistent with a double-crossover event (data not 
shown). The mulBwhygB mutant was devoid of MCM activity in 
cell extracts. 

Phenotype analysis of the 5. cinnamonensis mutant strains. 
The growth characteristics of wild-type (wt) S. cinnamonensis 
and the mutB::hygB and icmAvhygB mutants on solid minimal 
medium containing a single carbon source were determined, 
and the results are summarized in Table 2. When grown in a 



5602 



VRIJBLOED ET AL. 



J. Bacteriou 



TABLE I. Bacterial strains and plasmids used in this study 



Strain or plasmid 



Description 



Reference or 
source 



Sireptomyces spp. 
S. cinnamonensis 
A3823.5 

S. lividiins TK64 

E, coli ET12567 

Plasmids 
pUC18 
pGM160 
pIJ963 
pOCI403 
pOCI352 
pOCI353 



Monensin -overproducing strain Eli Lilly & Co. 

pro- 2 str-6 9 

fisdS dam-3 dcm-6 14 

17 

Temperature-sensitive plasmid 16 

Contains the hygB gene 13 

Contains 2.3-kb flam HI fragment of mutB in pUC18 This work 

Hygromycin resistance gene cloned as BamYW-BglU fragment in the BglU site of pOCI403 This work 

Contains the disrupted mutB gene cloned as BamHl fragment in pGM160 This work 



complex oil-based medium (medium 5, described in reference 
28), the wt and the two mutants grew equally well and pro- 
duced comparable levels of monensin A. However, during 
growth in this medium addition of isobutyrate or valine at a ca. 
5 mM concentration to the icmAvhygB mutant caused cessa- 
tion of further growth and monensin production. In a chemi- 
cally defined medium (4) containing mainly glucose, tyrosine, 
and valine as C and N sources, the growth of the mutBvMygB 
mutant was comparable to that of the wt, whereas the growth 
of the icmAvhygB mutant was weak. 

Incorporation of 13 C-labeled precursors into monensin A. 
The ability of /i-butyrate and isobutyrate to serve as precursors 
for monensin A biosynthesis in cultures of the mutant strains 
mutBvJiygB and icmAvhygB was tested by adding «-[l- ,3 C]bu- 
tyrate and [l- l3 C]isobutyrate to shake cultures of each, as well 
as to wt S. cinnamonensis. Monensin A was subsequently iso- 
lated and examined by ,3 C{'H} NMR spectroscopy. The ob- 
served enrichments are summarized in Table 3. Only trace 
amounts of monensin A could be isolated from the icmAvhygB 
strain when it was fed with [l- ,3 C]isobutyrate. The addition of 
isobutyrate (or valine) to this mutant strongly inhibited further 
growth and monensin production. The same effect was not 
observed upon feeding the mutant with /i-[l- ,3 C]butyrate. The 
mutBvhygB strain showed relatively high incorporations of ,3 C 



TABLE 2. Growth of S. cinnamonensis strains on solid minimal 
medium with various added carbon sources" 



Carbon Source 


wt 


muiBvJhygB 


icmArJiygB 


Glucose 


+ + + 


+ + + 


+ + + 


Succinate 


+ + + 


+ + + 


+ + + 


Valine 


+ + 


+ + 




Acetate 


+ + 


+ + 


+ + 


Propionate 


+ + + 




+ + + 


Butyrate 


+ + + 


+ 


+ + + 


Valerate y 


+ + + 




+ + + 


Caproate 


+ + 


+ 


+ + 


Isobutyrate 


+ + 


+ 




Isocaproic acid 


+ + 


+ 




Crotonic acid 


+ + 


+ 


+ + 


Ethyl acetoacetate 


+ + 


+ 


+ + 


rac-3-Hydroxybutyrate 


+ + + 


+ + - 


+ +•+ 


None 









° + + +, strong, rapid, dense mycelial growth; + + , intermediate growth; +. 
slow, weak, but still clear growth; -. almost no mycelial growth, corresponding 
to growth without a carbun source. 



in the propionate and butyrate units of monensin A upon being 
fed with both labeled butyrate and isobutyrate (Table 3). 

Ethyl [l,3- ,3 C 2 ]acetoacetate was fed to the wt and both S. 
cinnamonensis mutants, the monensin A was subsequently iso- 
lated from each culture, and ,3 C{ l H} NMR spectra were re- 
corded. For monensin A from both mutants, highly enriched 
doublet signals were observed surrounding the natural abun- 
dance singlets for all positions derived from C-l and C-2 of a 
propionate building block, i.e., in monensin A, C-l-C-2, C-3- 
C-4, C-5-C-6, C-ll-C-12, C-17-C-18, C-21-C-22, and C-23- 
C-24 (see Fig. 2 for examples). This implies the formation of 
[l,2- ,3 C 2 ]methylmalonyi-CoA from [l,3- ,3 C 2 ]acetoacetyl-CoA 
in vivo. Also, the signals for C-15 and C-32, corresponding to 
the C-l and C-3 positions in the butyrate building block, were 
strongly enriched (Fig. 2). 

DISCUSSION 

ICM is an MCM-like enzyme from S. cinnamonensis com- 
prising a large subunit (IcmA) of 566 residues (33) and a small 



TABLE 3. Enrichments observed in monensin A after 
incorporation of *-[l- l3 C]butyrate and [l- ,3 C]isobutyrate in the wt 
and in mutB/JiygB and icmA vhygB mutants of S. cinnamonensis 

n-[l- ,3 C}butyrate (1- ,A C] isobutyrate 

Monensin carbon 

wt mutB.hyg icmAvhyg wt mutBvhyg icmA.Jiyg . 



Acetate unit 



C-7 


1.4 


1.5 


1.3 


1.7 


1.6 


C-9 


1.5 


1.6 


1.6 


2.1 


1.8 


C-13 


1.5 


1.3 


1.4 


2.8 


1.7 


C-19 


1.8 


1.8 


1.5 


3.4 


2.2 


C-25 


1.6 


1.6 


1.3 


2.1 


1.8 


Propionate unit 












C-l 


5.0 


8.8 


3.1 


6.9 


9.2 


C-3 


• 5.1 


8.8 


3.0 


7.7 


9.3 


C-5 


5.2 


9.2 


3.1 


7.2 


9,3 


C-ll 


4.6 


8.8 


3.1 


8.2 


8.6 


C-17 


4.7 


8.7 


3.1 


6.8 


9.3 


C-21 


5.3 


9.5 


3.2 


7.1 


9.4 


-C-23 


4.9 


8.9 


3.1 


7.1 


9.0 


Butyrate unit 












C-15 


16.8 


12.3 


8.9 


17.0 


12.4 


■ ND, not determined (see text). 



Vol. 181, 1999 



MUTASES AND METHYLMALONYL-CoA 5603 




C19 



Monensin A 



C18 
C6 



C10 



C2 





C32 



C11 



C22 



C15 



C31 
C14 



44.0 42.0 40.0 38.0 36.0 34.0 32.0 30.0 28.0 

ppm 

FIG. 2. A portion of the '^{'H} NMR spectrum of monensin A isolated after feeding [l,3- ,3 C 2 ]ethyl acetoacetale to a muiB::hygB mutant of S. cinnamonensis. 
The strongly enriched signals from the n-butyrate unit in the backbone of monensin (at positions 15 and 32) are shown, along with the enriched doublets (for C-2, C-4. 
C-24, C-23, C-6, C-18. C-ll, and C-22) surrounding the natural abundance signals for positions derived from propionate building blocks. 



subunit (IcmB) of 136 residues (21). The MCM from this 
organism is a heterodimer with a large subunit (MutB) of 733 
residues and a small subunit (MutA) of 616 residues (2). In this 
work, MCM was inactivated in S. cinnamonensis by inserting a 
hygB gene into mutB, to produce an mutB::hygB mutant. The 
cloning and insertional inactivation of icmA from S. cinna- 
monensis were reported earlier (33). 

The effect of the icmA and mutB mutations on the growth of 
S. cinnamonensis was studied on solid minimal medium with 
various carbon sources. Both mutants show comparable growth 
on control plates containing glucose and succinate (Table 2). 
As with a previously reported knockout of MCM in Nocardia 
corallina (30), the S. cinnamonensis mutBwhygB mutant was 
unable to grow on propionate and valerate, since the conver- 
sion of methylmalonyl-CoA to succinyl-CoA was blocked (Fig. 
1). The growth of this mutant was also strongly retarded on Al- 
and isobutyrate, n- and isocaproate, and acetoacetate and cro- 
tonic acid as sole carbon sources. 

The icmA/.hygB mutant was unable to grow on valine, isobu- 
tyrate, or isocaproate, although it grew normally on acetate, 
butyrate, and valerate (Table 2). Moreover, addition of isobu- 
tyrate during growth in rich medium in liquid culture caused 



cessation of further growth and monensin production (see la- 
beling studies in Materials and Methods and Results). The 
ability of both mutBv.hygB and icmAv.hygB mutants to grow 
normally on acetate as sole carbon source is inconsistent with 
the operation of a recently proposed (7) novel anaplerotic 
pathway that is an alternative to the glyoxylate cycle, for utili- 
zation of C 2 and C 4 fatty acids in streptomycetes. 

Valine catabolism in pseudomonads and mammals (10, 15, 
32) proceeds via isobutyryl-CoA, 3-hydroxybutyric acid, and 
methylmalonic acid semialdehyde, to propionyl-CoA (indicat- 
ed in the dotted box in Fig. 1). The gene encoding methylma- 
lonic acid semialdehyde dehydrogenase was cloned recently 
from Streptomyces coelicolor (34). An msdAv.hygA mutant lost 
methylmalonic acid semialdehyde dehydrogenase activity as 
well as the ability to grow in a minimal medium with valine or 
isobutyrate as the sole carbon source. Nevertheless, labeling 
studies of many (but not all [19]) polyketide antibiotic-produc- 
ing streptomycetes (3, 18, 20, 22, 24) have shown that 
[2- ,3 C]valine and [1- ,3 C] isobutyrate label C-l of each propi- 
onate-derived subunit in the polyketide backbone (as indicated 
for monensin A in Fig. 1), rather than being lost as carbon 
dioxide. The results of these labeling experiments are incon- 



5604 VRIJBLOED ET AL. 



J. Bacteriol. 



sistent with a major flux through the Pseudomonas -mammalian 
pathway of valine catabolism during polyketide biosynthesis in 
the stationary phase in these streptomycetes, including 5. cin- 
namonensis. Moreover, since the S. cinnamonensis icmAvhygB 
mutant was unable to grow on valine or isobutyrate, this 
Pseudomonas -mammalian pathway of valine catabolism is ei- 
ther absent or poorly expressed in this mutant during growth 
on minimal medium. 

The conversion of isobutyryl-CoA to methylmalonyi-CoA by 
path A (Fig. 1) has been frequently discussed in studies of 
antibiotic-producing streptomycetes (29), including S. cinna- 
monensis (22). If path A is operative, however, this should 
provide a route for the utilization of valine and isobutyrate in 
the icmAwhygB mutant, and yet the mutant could not grow in 
minimal medium with isobutyrate or valine as sole carbon 
source (Table 2). It seems possible that, during antibiotic pro- 
duction in the stationary phase in a rich complex medium (28), 
a different pattern of gene expression and metabolite utiliza- 
tion may occur, compared to the situation on minimal agar. 
The inability of the icmAwhygB mutant to grow on isobutyrate 
and valine in minimal medium, therefore, may not rule out a 
role for path A (Fig. 1) in complex medium during the station- 
ary phase in S. cinnamonensis. 

An alternative route to methylmalonyl-CoA via succinyl- 
CoA might conceivably involve the conversion of isobutyryl- 
CoA to A?-butyryl-CoA, with a> oxidation of the latter to succi- 
nyl-CoA (path B in Fig. 1). The reversible conversion of 
crotonyl-CoA to 4-hydroxybutyryl-CoA catalyzed by an oxy- 
gen-sensitive, flavin-containing enzyme is known for Clostridia 
(23, 26). but it is so far unclear whether a similar enzyme 
occurs in streptomycetes, and no other route for the a> func- 
tionalization of rc-butyryl-CoA is presently known. In addition, 
it was shown earlier (27) that trideuteriolabeled acetate can be 
efficiently incorporated into all the propionate units in monen- 
sin A without loss of deuterium. This observation rules out a 
major flux from n-butyryl-CoA to methylmalonyl-CoA via suc- 
cinyl-CoA but is consistent with path A (Fig. 1). It should be 
noted, however, that similar experiments carried out with 
Streptomyces longisporoflavus, the producer of the polyether 
ICI139603 (also called tetronasin), showed that the C-methyl 
groups retain at most two deuterium atoms from acetate (5), as 
would be expected for an incorporation of the label via succi- 
nate and MCM (Fig. 1). 

To determine how n- and isobutyrate are utilized for 
polyketide biosynthesis in the icmA and mutB mutant strains, 
,3 C-Iabeled precursors were fed to each, and the pattern of 
incorporation into monensin A was determined by C NMR. 
An efficient incorporation of rt-[l- ,3 C]butyrate into the propi- 
onate-derived units in the backbone of monensin A was ob- 
served in the wt and was higher in the mutB mutant but slightly 
lower in the icmA mutant (Table 3). The enrichments in ace- 
tale-derived units in monensin A were low, indicating a mini- 
mal breakdown of [ ,3 C]butyrate to [ n C]acetate prior to incor- 
poration. These results are consistent with a major flux through 
path A (Fig. 1) but show also that label is still incorporated at 
a lower efficiency, by an alternative route, when ICM is inac- 
tivated. 

The efficient incorporation of [l- l3 C]isobutyrate into the 
propionate units in monensin A in the mutBv.hygB strain shows 
that the conversion of isobutyryl-CoA to methylmalonyl-CoA 
does not require MCM. Unfortunately, it was not possible to 
incorporate [l- l3 C]isobutyrate into monensin in the icmA mu- 
tant, because addition of isobutyrate (or valine) to the culture 
arrested further growth and monensin production. 

Finally, a labeling experiment with ethyl [l^^Cjjacetoac- 
etate was carried out to establish whether the incorporation of 



a C 4 precursor into the propionate units is possible without 
prior cleavage to acetate units. Most likely, the precursor is 
hydrolyzed by endogenous lipases or esterases, is activated in 
the cytoplasm to [l,3- ,3 C 2 ]acetoacetyl-CoA, and then is con : 
verted to /i-butyryl-CoA by steps similar to those involved in 
fatty acid metabolism (31) or polyhydroxybutyrate metabolism 
(1). The monensin A isolated from both mutants and the wt 
showed an essentially identical pattern of one-bond l3 C- l3 C 
couplings (Fig. 2), within each propionate unit, consistent with 
the formation and incorporation of [l,2- ,3 C 2 ]methylmalonyl- 
CoA without fragmentation of the precursor to acetate units. 
Hence, [l,3- 13 C 2 ]acetoacetyl-CoA can be converted efficiently 
into [l,2- 13 C 2 ]methylmalonyl-CoA in the absence of either 
ICM or MCM. Presently, it is uncertain how butyryl-CoA is 
converted intact into methylmalonyl-CoA when ICM is absent, 
although we disfavor path B in Fig. 1 for reasons discussed 
above. 

At the present time, we are unable to identify one unifying 
metabolic plan that fully explains all the data discussed above. 
A scenario, however, in which a still unrecognized route allows 
the conversion of n-butyryl-CoA (or acetoacetyl-CoA) into 
methylmalonyl-CoA in streptomycetes may be considered. In 
this respect, it is interesting to note that a third MCM-like 
protein of unknown function called MEA has been described 
recently (7). MEA has been implicated in the assimilation of 
C 2 compounds in Streptomyces collinus and in methanol and 
ethanol utilization in Methylobacterium extorquens (25). Fur- 
ther clarification of C 2 and C 4 metabolism in streptomycetes 
may benefit from the elucidation of the function of this MEA 
protein. 

ACKNOWLEDGMENTS 

This work was supported financially by the Canton of Zurich. 
J.W.V. and K.Z.-B. contributed equally to this work. 
Grateful acknowledgment is made to Annelies Meier for expert 
technical assistance. 

REFERENCES 

1. Anderson, A. J., and E. A. Dawes. 1990. Occurrence, metabolism, metabolic 
role, and industrial uses of bacterial polyhydroxyalkanoates. Microbiol. Rev. 
54:450-472. 

2. Birch, A., A. Leiser, and J. A. Robinson. 1993. Cloning, sequencing, and 
expression of the gene encoding methylmalonyl-coenzyme A mutase from 
Streptomyces cinnamonensis . J. Bacteriol. 175:3511-3519. 

3. Clark, C. A., and J. A. Robinson. 1985. Biosynthesis of nonactin. The role of 
acetoacetyl-CoA in the formation of nonactic acid. J. Chem. Soc. Chem. 
Commun., p. 1568-1569. 

4. Day, U E M J. W. Chamberlin, E, Z. Gordee, S. Chen, M. Gorman, R. L. 
Hamill, T. Ness, R. E. Weeks, and R. Stroshane. 1973. Biosynthesis of 
monensin. Antimicrob. Agents Chemother. 4:410-414. 

5. Doddrell, D. M. f E. D. Laue, F. J. Leeper, J. Staunton, A. Davies, A. B. 
Davies, and G. A. F. Ritchie. 1984. Biosynthesis of the polyether antibiotic 
ICU 39603 in Streptomyces longisporoflavus: investigation of deuterium re- 
tention after incorporation of CD 3 n COOH, ,3 CD 3 COOH and 
CH,CD 2 ,3 COOH using 2 H NMR and edited l, C NMR spectra. J. Chem. 
Soc. Chem. Commun., p. 1302-1304. 

6. Dorman, D. E., J. W, Paschal, W. M. Nakatsukasa, L. L. Huckstep, and N. 
Neuss. 1976. The use of l3 C-NMR spectroscopy in biosynthetic studies. II. 
Biosynthesis of narasin, a new polyether ionophore from fermentation of 
Streptomyces aureofaciens. Helv. Chim. Acta 59:2625-2634. 

7. Han, U, and K. A. Reynolds. 1997. A novel alternate anaplerotic pathway to 
the glyoxylate cycle in streptomycetes. J. Bacteriol. 179:5157-5164. 

8. Ilopwood, D. A. 1997. Genetic contributions to understanding polyketide 
synthases. Chem. Rev. 97:2465-2497. 

9. Ilopwood, D. A., M. J. Bibb, K. F. Chater, T. Kieser, C. J. Bruton, H. M. 
Kieser, D. J. Lydiate, C. P. Smith, J. M. Ward, and II. Schrempt 1985. 
Genetic manipulation of Streptomyces: a laboratory manual. John Innes 
Foundation, Norwich, United Kingdom. 

10. Kamoun, P. 1992. Valine is a precursor of propionyl-CoA. Trends Biochem. 
Sci. 17:175. 

11. Katz, L. 1997. Manipulation of modular polyketide syntheses. Chem. Rev. 
97:2557-2575. 

12. Leadlay, P, F. 1997. Combinatorial approaches to polyketide biosynthesis. 
Curr. Opin. Chem. Biol. 1:162-168. 



Vol 181, 1999 



MUTASES AND METHYLMALONYL-CoA 5605 



13. Lydiate, D. J., A. M. Ashby, D. J. Henderson, II. M. Kieser, and D. A. 
Hopwood. 1989. Physical and genetic characterization of chromosomal cop- 
ies of the Streptomyces coelicolor mini-circle. J. Gen. Microbiol. 135:941-955. 

14. MacNeil, D. j. 1988. Characterization of a unique methyl-specific restriction 
system in Streptomyces avertimilis. J. Bacterid. 170:5607-5612. 

15. Massey, L. K., J. R. Sokatch, and R. S. Conrad. 1976. Branched-chain amino 
acid catabolism in bacteria. Bacteriol. Rev. 40:42-54. 

16. Mulh, G., B. Nussbaumer, W, Wohlleben, and A. Punier. 1989. A vector 
system with temperature sensitive replication for gene disruption and mu- 

. tational cloning in Streptomyces. Mol. Gen. Genet. 219:341-348. 

17. Norrander, J.. T. Kempe, and J. Messing. 1983. Construction of improved 
M13 vectors using oligodeoxynucleotide-di reeled mutagenesis. Gene 
26:101-106. 

18. Omura, S., K. Tsuzuki, Y. Tanaka. H. Sakakibara, M. Aizawa, and G. 
Lukacs. 1983. Valine as a precursor of n-butyrate unit in the biosynthesis of 
macrolide aglycone. J. Antibiot, 36:614-616. 

19. Ono, M M H. Mishima, V. Takiguchi, and M. Terao. 1983. Milbemycins, a new 
family of macrolide antibiotics. Studies on the biosynthesis of milbemycins 
a 2 , a 4 and D using ,3 C labeled precursors. J. Antibiot. 36:991-1000. 

20. Pospfsil, S., P. Sedmera, M. Havranek, V. Krumphanzl, and Z. Vanflc 1983. 
Biosynthesis of monensins A and B. J. Antibiot. 36:617-619. 

21. Ratnalilleke, A., VV. J. Vrybloed, and J. A. Robinson. Cloning of the coen- 
zyme B l2 -binding domain of isobutyryl-CoA mutase from Streptomyces cin- 
namonensis, reconstitution of mutase activity and characterization of the 
recombinant enzyme produced in Escfierichia colt. Submitted for publication. 

22. Reynolds, K. A., I). O'Hagan, D. Gani, and J. A. Robinson. 1988. Butyrate 
metabolism in streptomycetes. Characterization of an intramolecular inter- 
change rearrangement linking isobutyrate and butyrate in Streptomyces cin- 
namonensis. J. Chem. Soc. Perkin Trans. I, p. 3195-3207. 

23. Scherf, U., and W, Buckel. 1993. Purification and properties of an iron-sulfur 
and FAD-containing 4-hydroxybutyryl-CoA dehydratase/vinylacetyl-CoA 
A*-A 2 -isomerase from Clostridium aminobutyricum. Eur. J. Biochem. 275: 
421-429. 

24. Sherman, M. M., S. Yue, and C, R. Hutchinson. 1986. Biosynthesis of 
lasalocid A. Metabolic interrelationships of carboxylic acid precursors and 
polyether antibiotics. J. Antibiot. 39:1135-1143. 



25. Smith, L. M., W. G. Meger, U Dijkhuizen, and P. M. Goodwin. 1996. A 
protein having similarity with methylmalonyl-CoA mutase is required for the 
assimilation of methanol and ethanol by Methylobacterium extorquens AMI. 
Microbiology 142:675-684. 

26. Sdhling, B., and G. Gottschalk. 1996. Molecular analysis of the anaerobic 
succinate degradation pathway in Clostridium kluyveri. J. Bacteriol. 178:871- 
880. 

27. Sood, G, R., D. M. Astmorth, A. A. Ajaz, and J. A. Robinson. 1988. Biosyn- 
thesis of the polyether antibiotic monensin A. Results from the incorpora- 
tions of labelled acetate and propionate as probes of the carbon chain 
assembly processes. J. Chem. Soc. Perkin Trans. 1, p. 3183-3193. 

28. Stark, W. M M N. G. Knox, and J. E. Westhead. 1968. Monensin, a new 
biologically active compound. II. Fermentation studies, p. 353-358. Aniimi- 
crob. Agents Chemother. 1967. 

29. Tang, U Y.-X. Zhang, and C. R. Hutchinson. 1994. The genetic basis of 
precursor supply for the biosynthesis of macrolide and polyether antibiotics, 
p. 105-116. In R. K. Bajpai and A. Prokop (ed.), Recombinant DNA tech- 
nology II. The New York Academy of Sciences, New York, N.Y. 

30. Valentin, H. E., and D. Dennis. 1996. Metabolic pathway for poly(3-hydroxy- 
butyrate-co-3-hydroxyvalerate) formation in Nocardia corallina: inactivation 
of mutB by chromosomal integration of a kanamycin resistance gene. Appl. 
Environ. Microbiol. 62:372-379. 

31. Wallace, K. K.. Z.-Y. Bao, H. Dai, R. Digate, G. Schuler, M. K. Speedie, and 
K. A. Reynolds. 1995. Purification of crotonyl-CoA reductase from Strepto- 
myces collinus and cloning, sequencing and expression of the corresponding 

" gene in Escherichia coli. Eur. J. Biochem. 233:954-962. 

32. Wolf, D. A., and H. A. Akers. 1986. Uncertainties remain in the catabolism of 
valine. Trends Biochem. Sci. 11:390-392. • 

33. Zerbe Burkhardt, K., A. RatnaliHeke, N. Philippon, A. Birch, A. Leiser, J. W. 
Vrybloed, D. Hess, P. Hunzikcr, and J. A. Robinson. 1998. Cloning, sequenc- 
ing, expression, and insertional inactivation of the gene for the large subunit 
of the coenzyme B ^-dependent isobutyryl-CoA mutase from Streptomyces 
cinnamonensis. J. Biol. Chem. 273:6508-6517. 

34. Zhang, Y. X., L. Tang, and C. R. Hutchinson. 1996. Qoning and character- 
ization of a gene (msdA) encoding methylmalonic acid semialdehyde dehy- 
drogenase from Streptomyces coelicolor, J. Bacteriol. 178:490-495. 



Proc. Natl. Acad. Sci. USA 

Vol. 95, pp. 7305-7309, June 1998 

Biochemistry 



Ethyl-substituted erythromycin derivatives produced by directed 
metabolic engineering 

D. L. STASSltt, S. J. KAKAVASt, K. A. REYNOIX>S§, G. GUNAWARDANAt, S. SWANSONt, D. ZEIDNERt, M. jACKSONt, 
H. LlU§, A. BUKOt, AND L. KATZt 

^Pharmaceutical Products Division, Abbott Laboratories, Abbott Park, IL 60064; and ^Institute of Structural Biology and Drug Discovery, Department of 
Medicinal Chemistry, Virginia Commonwealth University, Richmond, VA 23219 

Edited by Arnold L. Demain, Massachusetts Institute of Technology, Cambridge, MA, and approved April 27, 1998 (received for review 
February 26, 1998) 



ABSTRACT A previously unknown chemical structure, 
6-desraethyl-6-ethylerythromycin A (6-ethyIErA), was pro- 
duced through directed genetic manipulation of the erythro- 
mycin (Er) -producing organism Saccharopolyspora erythraea. 
In an attempt to replace the methyl side chain at the C-6 
position of the Er polyketide backbone with an ethyl moiety, 
the methylmalonate-specific acyltransferase (AT) domain of 
the Er polyketide synthase was replaced with an ethylma- 
lonate-specific AT domain from the polyketide synthase in- 
volved in the synthesis of the 16-member macrolide niddamy- 
cin. The genetically altered strain was found to produce ErA, 
however, and not the ethyl-substituted derivative. When the 
strain was provided with precursors of ethylmalonate, a small 
quantity of a macrolide with the mass of 6-ethylErA was 
produced in addition to ErA* Because substrate for the 
heterologous AT seemed to be limiting, crotonyl-CoA reduc- 
tase, a primary metabolic enzyme involved in butyryl-CoA 
production in streptomycetes, was expressed in the strain. The 
primary macrolide produced by the reengineered strain was 
6-ethylErA 



Erythromycin (Er) is a broad-spectrum macrolide antibiotic 
produced by Saccharopolyspora erythraea. The backbone of the 
molecule is a 14-member macrocyclic ring (Fig. 1) that is 
produced through the sequential condensation of one mole- 
cule of propionyl-CoA and six molecules of methylmalonyl- 
CoA by a modular polyketide synthase (PKS). This enzyme 
complex comprises three large multifunctional polypeptides, 
DEBS1, DEBS2, and DEBS3, each of which contains two 
modules, and, in the case of DEBS1, a loading domain to 
initiate synthesis of the polyketide chain (1, 2). Each module 
contains the enzymatic activities necessary for one condensa- 
tion and subsequent reduction of the extender to the growing 
chain. Selection of the appropriate extender unit is accom- 
plished by the acyltransferase (AT) domain present in each 
module (3). 

Recently, genetic engineering has joined synthetic chemistry 
in the production of novel macrolide structures. Hybrid PKSs 
have been constructed through the replacement of AT do- 
mains with those that specify different starter or extender units 
(3-6), These manipulations have resulted in the production of 
Er derivatives lacking methyl groups at C-10 and C-12 (5) and 
those incorporating the branched chain starter units of aver- 
mectin biosynthesis (6). To date, however, there have been no 
manipulations that allow the methyl side chains of Er to be 
replaced by ethyl groups. This type of modification would be 
useful in expanding the structural diversity of hybrid 
polyketides produced by combinatorial biosynthesis. 



The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "advertisement' in 
accordance with 18 U.S.C. §1734 solely to indicate this fact. 

© 1998 by The National Academy of Sciences 0027-8424/98/95 7305-5 S2.00/0 
PNAS is available online at http://www.pnas.org. 



Niddamycin (Nd) is a 16-member macrolide with an ethyl 
side chain at C-6 (Fig. 1). The module 5 AT, which is believed 
to be responsible for incorporation of the ethyl side chain into 
the polyketide backbone, was identified by sequence analysis 
of the Nd PKS genes (7). In this paper we describe the 
construction of a hybrid PKS through the replacement of a 
methylmalonyi-specific AT of the Er PKS with that of the 
ethylmalonyl-specific AT of the Nd PKS. This substitution 
alone, however, was not sufficient to produce an Er derivative 
with an ethyl side chain; additional manipulations of the 
carboxylic acid precursor pools were necessary to generate the 
desired compound. 

MATERIALS AND METHODS 

Strains, Plasmids, and Media. The wild-type Er producer is 
S. erythraea ER720 (8). Plasmid pWHM3 (9) is an Escherichia 
coli-Stfeptomyces shuttle vector that is maintained in S. eryth- 
raea only when it contains heterologous DNA for chromo- 
somal integration. Plasmid pDPE81 is a derivative of pKAS37 
(10) in which a 1.7-kb Bglll fragment containing the hygro- 
mycin resistance marker was inserted into the Bglll site of 
pKAS37. E, coli DH5a (Life Technologies, Gaithersburg, MD) 
was the host used for plasmid construction and isolation. 
Liquid cultures of 5. erythraea strains were grown in SGGP (11) 
for production of protoplasts and SCM (5) for metabolite or 
enzyme analysis. Plate cultures of S. erythraea strains were 
grown on R3M medium (5). Thiostrepton (Ts)-resistant strains 
of S. erythraea were grown in 25 /ig/ml and 10 jtg/ml Ts for 
plate and liquid cultures, respectively. 

Plasmid pEAT4 (Fig. 2A) was constructed as follows. Cos- 
mid pAIBX85, a pWHM3 derivative containing DNA from 
modules 3 and 4 of the Er PKS (corresponding to nucleotides 
979-9349; GenBank accession no. M63677), was used to clone 
DNA flanking eryAT4. The 5' flanking region was isolated by 
digesting pAIBX85 with Mscl and BstEll (nucleotides 4247- 
6033), treating with the Klenow fragment of DNA polymerase, 
and ligating the fragment into the Smal site of pUC19 to 
generate pUC/5' flank. An Avrll site was engineered 13 bp 
downstream of the BstEll site by PCR amplification of a 
306-bp region of DNA from the Pmll site (nucleotide 5739) to 
12 bp 3' of the BstEll site (nucleotide 6045). The engineered 
Avrll site does hot change th6 Pro-Arg residues encoded by this 
region (Fig. 2B). A BamHl site was also included on the PCR 
primer just downstream of the AvrU site. The resulting frag- 
ment was digested with Pmll and BamHl and cloned into the 
Pmll/BamHl site of pUC/5' flank, replacing the native se- 



This paper was submitted directly (Track I!) to the Proceedings office. 
Abbreviations: 6-ethylErA, 6-desmethyl-6-ethylcrythromycin A; AT, 
acyltransferase; Ccr, crotonyl-CoA reductase; Er, erythromycin; Nd, 
niddamycin; PKS, polyketide synthase; Ts, thiostrepton. 
*To whom reprint requests should be addressed at: Abbott Labora- 
tories, CAPD, D-4FR F3-3, 1401 Sheridan Road, North Chicago, IL 
60064. e-mail: diane.stassi@abbott.com. 



7305 



7306 Biochemistry: Stassi et ai 

CH 3 




Proc. Natl. Acad. Sci USA 95 (1998) 

CH, 



em 

j 



^ mycaminose 
0 mycarose 



isobutyrate 




Erythromycin A 



NkWamydn 



6-Desmethyt-6-ethyterythrofnycin A 



Fig. 1. Structures of Er, Nd, and 6-ethylErA. Shadowed letters represent relevant side groups. 



quence and resulting in vector pUC/5' flank//! vrll. The 3' 
flanking DNA was isolated by digesting pAIXB85 with Pmtl 
and Mscl (nucleotides 6999 and 8977), treating with the 
Klenow enzyme, and ligating the fragment into the Smal site 
of pUC19, generating vector pUC/3' flank. nidAT5 was 
isolated by PCR amplification of cosmid 13f5 (7), which 
contains DNA spanning module 5 of the Nd PKS cluster. The 
5' oligonucleotide was designed to create an Avrll site 12 
nucleotides upstream of the beginning of the nidAT5 domain 
(nucleotide 25290; GenBank accession no. AF016585) and to 
place the amino acid sequence Pro-Arg-Lys-Pro in front of 
nidAT5 to correspond to the sequence that is found upstream 
of the eryAT4 domain. The 3' oligonucleotide generated an 
Fsel site at the end of the nidATS domain (nucleotide 26284, 
GenBank accession no. AFO 16585), resulting in a conservative 
Val to Ala change, A BamHl site was also incorporated after 
the Fsel site. The fragment was digested with Avrll and BamlU 
and ligated into v4vrII/£flmHI-digested pUC/5' flank/zivrll, 
creating vector pUC/5' f lank/nidAT5. The 3' flanking DNA 
was then cloned onto the 3' end of nidAT5 by first digesting 
pUC/3' flank with Fsel and BamHl, gel purifying the 1920-bp 



4247 



6045 



EcoRlH 



B 



nidATS 



7045 



8977 



HinDlU 



eryAT4wfld type 
nidATS wild type 
eryAT4Bank/ridAT5 



eryAT4 wild type 
nidATS wild type 
eryAT4flahk/hldAT5 



N-terminus 

APVTSAPRKP VLVFPGQGAQ 
TGTAGTADRV VFVFPGQGSQ 
APVTSAPRKP VFVFPGQGSQ 



Avrll site 



C-terminus 

ADLRPAVAGG RPAEIPTYPE 
DPEAVFGPGA RPVELPTYPF 
OPEAVFGPGA RPAELPTYPF 



T 

Fsel site 

Fig. 2. (A) Diagrammatic representation of insert in vector 
pEAT4. DNA encoding the AT from module 5 of the Nd PKS cluster 
(nidATS) was cloned between DNA flanking the 5' and 3' boundaries 
of the AT coding region in module 4 of the Er PKS cluster. The 
numbered boxes correspond to eryA sequence coordinates from 
GenBank accession no. M63676. (B) Amino acid comparisons in the 
junction regions of eryAT4, nidAT5, and the eryAT4 flank/nidAT5 
construct. The amino acids encoding the Avrll and Fsel sites are 
indicated. 



fragment, and ligating it into /sel/flamHI-digested pUC/5' 
flank/nidAT5, creating vector pUC/nidAT5/C6-flank. To 
generate pEAT4, the nidATS /flanking DNA cassette was 
isolated from pUC/nidAT5/C6-f lank by digestion with EcoRl 
and Hindlll and then ligated to £coRI/////idIII-digested 
pWHM3. 

Plasmid pDPE-ccr (Fig. 3) was constructed as follows. The 
Streptomyces collinus crotonyl-CoA reductase (Ccr) gene (ccr) 
was subcloned from plasmid pZYB3 (12) by digestion with 
Xbal and BamHl, which releases ccr and the upstream T7 
ribosomal binding site. This fragment was treated with the 
Klenow enzyme and ligated into the polylinker of pDPE81 that 
had been digested with EcoRI and treated with the Klenow 
enzyme. The polylinker of pDPE81 is in the center of a 10-kb 
fragment of S. erythraea chromosomal DNA, which directs 
integration into the chromosome at a site that does not seem 
to affect Er production. Plasmid pDPE-ccr was designed so 
that the ccr gene expressed from the ermE* promoter (13) and 
the Ts-resistance marker are left behind in the chromosome 
following a double-crossover event. 

Genetic Manipulations. Standard molecular biology tech- 
niques were performed as described (14). Enzymes and re- 
agents were purchased from Life Technologies. Protoplast 
transformation and marker replacement in S. erythraea were 
performed as described (5). One microgram of plasmid DNA 
was routinely used for protoplast transformations. Putative 
transformants were grown in SGGP medium containing 10 
fxg/ml Ts to confirm resistance. For chromosomal eviction of 
pEAT4, transformants were passaged twice in SGGP medium 
without Ts, cells were plated for spores, and individual colo- 
nies arising from spores were screened for Ts sensitivity. 




Ntiefc 



Kpnl 



Fig. 3. Plasmid pDPE-ccr. The thick black lines represent S. 
erythraea DNA, which allows for integration of the plasmid by homol- 
ogous recombination into the S. erythraea chromosome. Also indicated 
are genes for ampicillin (amp) and Ts resistance. 



Biochemistry: Stassi et ai 

Electroporation was performed as described (15) by using 1 /xg 
of pDPE-ccr DNA. Because the plasmid is unstable when 
integrated into the S. erythraea chromosome, two consecutive 
platings on Ts-containing R3M plates results in resolution and 
eviction of plasmid sequences. PCR and Southern hybridiza- 
tion were as described (7). 

Ccr Assay. Seed cultures of S. erythraea grown in SCM and 
S, collinus grown in medium A (16) were diluted 5-fold into 
SCM and medium A, respectively, and grown for 48 h at 30°C. 
Cells were harvested, opened with a French pressure cell, 
centrifuged to obtain cell-free extracts, and assayed spectro- 
photoraetrically for Ccr activity as described (16). One unit of 
Ccr activity is defined as 1 punol of NADPH oxidized in 1 min. 

Isolation and Identification of Metabolites. Small-scale 
(milliliters) isolation of metabolites from S. erythraea, TLC, 
and MS analysis were performed as described (5). Large-scale 
isolation for compound identification was as follows. Fermen- 
tations were conducted in 42-liter LH fermentation series 2000 
stainless steel vessels (LH Fermentation, Maidenhead, UK). 
Each fermentor was charged with 30 liters of a medium 
consisting of 1.5% soluble starch, 2.2% soybean flour, 0.2% 
CaC0 3 , 0.15% brewer's yeast (Wind Gap Farms, Baconton, 
GA), 0.102% MgS0 4 -7H 2 0, 0.0027% FeS0 4 -7H 2 0, and 1% 
soybean oil. Sterilization was at 121°C and 15 psi (1 psi = 6.89 
kPa). A two-step seed protocol was used. Vegetative growth 
from a frozen stock of S. erythraea strain EAT4-ccr was 
inoculated at 1.5% into a 500-ml Erlenmeyer flask containing 
100 ml of the following medium: 1.5% glucose monohy- 
drate/1% soluble starch/1% Soytone (Difco)/0.9% yeast ex- 
tract (Difco)/0.21% Mops/0.001% Ts. For the second step, a 
2-liter Erlenmeyer flask containing 600 ml of the same me- 
dium was inoculated at 5% with the first-step growth. Both 
steps were incubated at 28°C on a rotary shaker at 225 rpm for 
48 h. The fermentor was inoculated at 5% with the second-step 
seed growth. Fermentor temperature was controlled at 32°C, 
agitation was 250 rpm, aeration was 1 vol*vol~ K min -1 , and 
head pressure was 6 psi. Silicone antifoam was added at 0.01% 
initially and was available on demand. Harvest was at 108 h. 

The fermentation broth (28 liters) was filtered, the pH was 
adjusted to 9 with NH 4 OH, and the broth was extracted with 
CH 2 C1 2 (twice with 14 liters each). The combined extracts were 
concentrated, and the residue was partitioned between the two 
phases of the heptane/methanol/0.02 M K 2 HP0 4 system (pH 
6; 1:1:1). The aqueous phase was evaporated, and the residue 
was dissolved in 0.05 M aqueous potassium phosphate buffer 
(100 ml), adjusted to pH 9 with N^OH, and extracted twice 
with equal volumes of CH 2 C1 2 . The CH 2 Cl 2 -soluble fraction 
(1.65 g) was separated by droplet countercurrent chromatog- 
raphy (100 vertical columns; 0.4 x 24 cm) by using the upper 
layer from the hexane/ethyl acetate/0.02 M K 2 HP0 4 (pH 8; 
1:1:1) system as the mobile phase. The fractions were analyzed 
by bioassay against Staphylococcus aureus and by J H NMR. The 
bioactive fractions that were shown to contain macrolides by 
NMR were pooled, dried (0.4 g), and chromatographed on a 
Sanki Engineering (Kyoto) HPLC centrifugal partition chro- 
matograph by using the hexane/ethyl acetate/0.02 M K 2 HP0 4 
(pH 6; 1:1:1) system. The active fraction was further purified 
by HPLC on Cis-derivatized silica by using an acetonitrile/ 
methanol/0.01 M (CH 3 ) 4 NOH/0.05 M KH 2 P0 4 (pH 6; 
.73:10:59:59) system. Two-dimensional NMR experiments, in- 
cluding double-quantum correlation, heteronuclear multiple 
quantum correlation, and heteronuclear multiple-bond corre- 
lation, performed in C 2 HC1 3 , were used for structure elucida- 
tion and the complete assignment of the l H and 13 C NMR 
signals. 

In Vitro Antibacterial Activity. Antibacterial activity was 
determined by the broth microdilution method (National 
Committee for Clinical Laboratory Standards ( Villanova, PA) 
M7-A4, 1997), except that Brain Heart Infusion (Difco) broth 
was used as the test medium. Assays were incubated overnight 



Proc. Natl. Acad. ScL USA 95 (1998) 7307 

at 35°C. Minimal inhibitory concentrations were defined as the 
lowest drug concentration (jxg/ml) inhibiting bacterial growth. 

RESULTS 

Construction of S. erythraea EAT4. Plasmid pEAT4 was 
constructed to replace DNA encoding the AT domain of 
module 4 in the Er PKS (eryAT4) with DNA encoding the AT 
domain from module 5 of the Nd PKS (nidAT5) (7). S. 
erythraea ER720 protoplasts were transformed with pEAT4 
DNA, and 10 transformants were obtained. Genomic DNA 
was extracted from one of the transformants for Southern 
analysis, in which probing with pWHM3 DNA confirmed the 
integration of pEAT4 at the appropriate location in the 
chromosome (data not shown). Nonselective growth to allow 
plasmid eviction through a double-crossover event yielded 96 
colonies, of which 9 were found to be Ts sensitive. Southern 
analysis of the 9 clones showed that 3 had nidAT5 DNA 
sequences in place of eryAT4 chromosomal sequences, and the 
remaining 6 had segregated to wild type (data not shown). 

Characterization of 5. erythraea EAT4. To analyze the Er 
derivatives produced by S. erythraea EAT4, the three isolates 
and four of the wild- type segregants were grown in SCM 
medium for 4 d. The four wild-type segregants produced spots 
indistinguishable in color and Rt from the ErA standard. The 
three strains in which nidAT5 replaced the AT of module 4 of 
the Er PKS produced spots similar in R( and color to the 
wild-type segregants, but the spots were much less intense 
(data not shown). 

To determine the mass of the compound produced by S. 
erythraea EAT4, the supernatant of a 50-ml SCM culture of 
one of the isolates was extracted with ethyl acetate. The extract 
was subjected to TLC, but only the edges of the plate were 
sprayed with anisaldehyde to locate the region of interest. A 
1-cm band of resin was scraped from the unsprayed portion of 
the plate at the R( of ErA. The resin was extracted twice with 
500 /xl of an ethyl acetate/methanol (2:1) solution, and the 
organic phase was dried and then analyzed by electrospray 
ionization MS. Surprisingly, the mass of the protonated mo- 
lecular ion of the compound was observed at m/z 734, which 
corresponds to the mass of ErA, not an ethyl-substituted ErA 
derivative. 

Feeding of Ethylmalonyl-CoA Precursors to 5. erythraea 
EAT4. One hypothesis for the failure of 5. erythraea EAT4 to 
produce an ethyl-substituted derivative is that ethylmalonyl- 
CoA, the substrate for the nidAT5, is lacking. To test this 
hypothesis, cells were grown for 4 d in SCM medium contain- 
ing precursor compounds, and ethyl acetate extracts of the 
cultures were analyzed. When either 50 mM butanol or 50 mM 
butyrate was added to the medium, a second spot running 
slightly faster than the R { of ErA was seen. Butyrate-fed 
cultures (Fig. 44) produced about a 1:1 ratio of the two spots, 
whereas the butanol-fed culture produced more of the ErA- 
like spot (data not shown). Addition of 10 mM ethylmalonate 
failed to produce significant amounts of either of the two spots 
described above. However, addition of 10 mM diethylethyl- 
malonate was found to yield much more of the faster migrating 
compound than that seen with the other precursor compounds 
tested and very little compound migrating at the R( of ErA 
(Fig. 4 A). Cells grown in unsupplemented SCM medium 
produced only material migrating like ErA. 

To characterize the newly synthesized compound, extracts of 
the butyrate- and the diethylethylmalonate-fed cultures were 
subjected to TLC, the region of interest was scraped from the 
plate and reextracted for electrospray ionization MS. The 
results showed that the butyrate-fed culture of S. erythraea 
EAT4 produced approximately equal amounts of compounds 
with protonated molecular ions at m/z 734 and 748. The 748 
species is consistent with an additional methylene group on 
ErA, e.g., with an ethyl group replacing a methyl group on the 



7308 Biochemistry: Stassi et al 

A B 













• - f . . 
















dim 




WMM 




2 3 



Fig. 4. TLC analysis of 5. erythraea EAT4 and 5. erythraea EAT4- 
ccr. (M) Ethyl acetate extracts of S. erythraea EAT4 cultures grown 
under various conditions. Lanes: 1, SCM medium; 2, SCM + 50 mM 
butyrate; 3, SCM + 10 mM diethylethylmalonate; 4, ErA standard (5 
/xg). The arrows indicate the position of ErA. (B) Ethyl acetate extracts 
of 5. erythraea strains EAT4 (lane 2) and EAT4-ccr (lane 3) grown in 
SCM medium, along with 5.0 jig of ErA standard (lane 1). The arrow 
indicates the position of ErA. 

macrolide.ring. The diethylethylmalonate sample contained 
primarily a compound with a mass of 748, with only a trace of 
ErA (734) present. 

Genetic Manipulation of Ethylmalonyl-CoA Levels in S. 
erythraea EAT4. Successful production of an Er derivative with 
a mass consistent with the addition of an ethyl side chain by 
butyrate- and diethylmalonate-fed S. erythraea EAT4 sug- 
gested that the levels of ethylmalonyl-CoA, the likely substrate 
of the nidAT5, must be limiting. In strep tomycetes, Ccr 
catalyzes the last step in the reductive biosynthesis of butyryl- 
CoA from two molecules of acetyl-CoA (12). Because butyryl- 
CoA can then be carboxylated to form ethylmalonyl-CoA, the 
possibility that Ccr could be used to increase the levels of the 
ethyl-substituted Er derivative was investigated. The ccr gene 
of 5. collinus was expressed from the strong ermE* promoter 
at a site unlinked to the Er biosynthetic cluster in the S. 
erythraea EAT4 chromosome. Plasmid pDPE-ccr was electro- 
porated into 5. erythraea EAT4. About 40 Ts-resistant colonies 
were obtained, of which 7 were confirmed to be stable 
transformants. Genomic DNA was isolated from 2 of the 
transformants for Southern analysis. Probing with the S. 
collinus ccr gene confirmed that both isolates carried the gene 
at the expected location in the chromosome (data not shown). 
The strain was named S. erythraea EAT4-ccr. 

To assess expression levels of the ccr gene, Ccr activity of SI 
erythraea EAT4-ccr was compared with that of S. erythraea 
EAT4 (parental strain) and wild-type S. collinus. We found 
(Table 1) that, although there was no detectable activity in the 
S. erythraea strain without the ccr gene, 5. erythraea EAT4-ccr 

Table 1. Ccr activity in S. collinus and engineered 
S. erythraea strains 





Enzyme 


Protein 


Specific 




activity, 


concentration, 


activity, 


Sample. 


milliunits/ml 


mg/ml 


miHiunits/mg 


S. erythraea EAT4 


<0.5 ± 0.0 


3.0 


<0.2 ± 0.0 


S. erythraea EAT4-ccr 


46.9 ± 2.0 


3.0 


15.6 ± 0.7 


S. collinus wild type 


5 J ± 1.0 


7.0 


0.8 ± 0.1 



Proc. Natl Acad. ScL USA 95 (1998) 

had about 20 times the relative activity of the wild-type S. 
collinus strain from which the gene originated. 

To determine the effect of the ccr gene on production of an 
ethyl-substituted Er derivative, ethyl acetate extracts of super- 
natants of 4-d SCM-grown cultures of the EAT4 strains with 
and without the ccr gene were examined (Fig. 4B). TLC 
analysis of 10 ml of extracted cells demonstrated that EAT4 
without the ccr gene produced a spot that migrated with ErA, 
whereas EAT4-ccr produced a spot that migrated faster than 
the ErA spot, with no ErA-like material visible. 

Extracts were also analyzed by electrospray ionization MS. 
The EAT4 strain produced a compound with a protonated 
molecular ion at m/z 734, indicating ErA production. The 
EAT4-ccr strain produced a compound with a mass of 748, 
which is consistent with the production of the 6-ethylErA. No 
compound with a mass of 734 (ErA) was detected in extracts 
of this strain. 

Structural Analysis and Biological Activity of Compound 
Produced by 5. erythraea EAT4-ccr. The structure of 6-ethyl- 
ErA was confirmed by spectroscopic methods. The high 
resolution fast atom bombardment MS analysis of the sample 
gave an M + H + ion at m/z 748.4846 [calculated for 
C38H70NO13 m/z = 748.4842 (A 0.5 ppm)], indicating that this 
molecule has an additional methylene function compared with 
ErA. The presence of an ethyl moiety in the molecule at the 
C-6 position and its relative stereochemistry were confirmed 

Table 2. NMR data of 6-ethylErA 





,3 C shift (S\ 

v — ami 1 


»H shift (5), 


Carbon no. 


ppm 


ppm 


1 


175.5 




2 


44.8 


2.93 


3 


80.7 


3.79 


4 


37.6 


1.93 


5 


81.6 


3.86 


6 


76.7 




7 


38.4 


1.72, 193 


8 


44.7 


2.70 


9 


222.4 




10 


37.7 


3.01 


11 


68.9 


3.82 


12 


74.4 




13 


76.7 


5.06 


14 


21.2 


1.47, 1.91 


15 


10.7 


0.84 


2-CH3 


15.9 


1.29 


4-CH3 


9.7 


1.14 


6-CH2-CH3 


27.6 


1.70, 2.02 


6-CH2-CH3 


7.7 


0.98 


8-CH3 


18.6 


1.17 


IO-CH3 


12.3 


3.01 


I2-CH3 


68.9 


3.82 


r 


102.5 


4.48 


2' 


72.5 


3.21 


3' 


65.3 


2.44 


4' 


28.3 


1.22, 1.63 


5' 


68.7 


3.52 


6' 


21.6 


1.22 


3'-N(CH 3 ) 2 


40.7 


2.29 


r 


96.2 


4.92 


T 


35.3 


2.34, 1.66 


V 


72.4 




4- 


77.7 


3.02 


5" 


66.3 


3.98 


6' 


21.6 


1.29 


3--OCH3 


49.7 


3.33 


3*-CH 3 


21.4 


1.26 



Numbering of carbon atoms of Er is as described (17). 



Biochemistry: Stassi et al. 



Proc. Natl. Acad. Sci. USA 95 (1998) 7309 



by two-dimensional NMR. The stereospecificity of the ethyl 
moiety was found to be the same as that of the methyl group 
present at C-6 of ErA (Fig. 1). The *H and n C NMR 
assignments are shown in Table 2. 

Antibacterial activity of 6-ethylErA was compared with that 
of ErA against a panel of S. aureus, Staphylococcus epidermidis, 
Streptococcus pyogenes, and Enterococcus faecium strains from 
the Abbott culture collection. Although 6-ethylErA does re- 
tain biological activity, it was found to be 15 to 60 times less 
potent than ErA against these strains. Representative minimal 
inhibitory concentrations (/xg/ml) for 6-ethylErA and ErA, 
respectively, were as follows: S. aureus (4 and 0.06), S. epider- 
midis (4 and 0.12), E. faecium (4 and 0.12), and S. pyogenes (1 
and 0.06). 

DISCUSSION 

In this work we have shown that an ethylmalonate AT can be 
substituted for a methylmalonate AT to produce a hybrid PKS 
that synthesizes the predicted macrolide product, 6-ethylErA. 
This suggests that Er analogs containing ethyl substitutions at 
positions other than C-6 or that ethyl analogs of other mac- 
rolides, such as rifamycin or rapamycin, can be prepared in a 
similar fashion by using the ethylmalonate-specific AT from 
the Nd PKS. 

One key feature for the successful generation of 6-ethylErA 
was availability of ethylmalonyl-CoA as substrate for the 
ethylmalonate-specific AT of the Nd PKS in S. erythraea. This 
AT clearly demonstrates a relaxed specificity by reacting with 
both ethylmalonyl-CoA and methylmalonyl-CoA. In S. eryth- 
raea EAT4, the desired product could be obtained only by 
boosting the levels of ethylmalonyl-CoA by either precursor 
feeding or expression of Ccr, an enzyme involved in butyryl- 
CoA biosynthesis. These results suggest that ethylmalonyl- 
CoA derived from butyryl-CoA is not available at sufficient 
levels to compete with methylmalonyl-CoA for the ethylma- 
lonate-specific AT. Ccr activity is not detectable in S. erythraea 
and attempts to detect a ccr gene in this organism by hybrid- 
ization with the S. collinus ccr gene have failed (results not 
shown). In contrast, the ccr gene appears to be present in most 
streptomycetes (16), and in certain cases it appears to be 
clustered with antibiotic biosynthetic gene clusters. For exam- 
ple, a ccr homolog has been located in the biosynthetic cluster 
for tylosin (18), another 16-member macrolide with an ethyl 
side chain. Southern analysis of cosmid clones containing Nd 
biosynthetic genes also indicates that a ccr homolog may reside 
close to the Nd PKS (unpublished results). 

It was demonstrated previously (5) that production of 
desmethyl Er derivatives by replacement of methylmalonate- 
specific ATs of the Er PKS with malonate-specific ATs from 
Streptomyces hygroscopicus and Streptomyces venezuelae was 
successful only in Er modules 1 and 2 (corresponding to C-12 
and C-10 of Er, respectively) and did not give rise to a 
detectable polyketide when placed in module 4. In contrast, we 
successfully produced ethyl-substituted Er by replacement of 
the methylmalonate-specific AT of Er module 4 with the 
ethylmalonate-specific AT from the Nd cluster. In fact, in S. 
erythraea expressing the S. collinus ccr gene, replacements in 
modules 1 through 4 with the ethylmalonate-specific AT 
produced compounds with electrospray ionization mass spec- 
tra consistent with production of ethyl substitutions for methyl 
groups at C-12, C-10, C-8, and C-6, respectively. However, 
those in modules 5 and 6 did not produce compounds of mass 
748 (data not shown). It is not known whether failure to 
produce the predicted compounds was caused by physical 
distortion of the PKS or its mRNA, inability of the PKS to 
process certain altered growing chains, or some structural 



instability of the macrolide itself because of the introduced 
change. Thus, even though genetic information may be present 
for the production of novel compounds, it is still not possible 
to predict which substitutions will yield detectable levels of 
product. 

It has also been demonstrated that the malonyl -specific ATs 
can be distinguished from methylmalonyl-specific ATs through 
sequence alignments (3). By using this strategy, Ruan et al. (5) 
predicted that an AT from an unidentified PKS of S. hygro- 
scopicus was a malonyl-specific AT, and they used it success- 
fully to produce desmethyl Er derivatives. The Nd ethyl- 
specific AT was found to cluster with methylmalonyl-specific 
ATs (7) in similar AT alignments. This similarity may explain 
why methylmalonate was used as a substrate by nidAT5 in S. 
erythraea EAT4 to produce ErA when ethylmalonyl-CoA was 
not available. 

Finally, structural determination by NMR not only con- 
firmed the presence of the ethyl side chain on the Er derivative 
produced by 5. erythraea EAT4 but also showed that the 
absolute configuration at this chiral center is the same as that 
in ErA. In Er biosynthesis, epimerization is required at C-6 
because the PKS uses (25)-methylmalonyl-CoA for chain 
extension (19). It is unclear whether the cognate epimerization 
occurs when ethylmalonate is used in the synthesis of 6-eth- 
ylErA because the stereochemistry of the ethylmalonate in- 
corporated at C-6 is not known. 

We thank Angela Nilius for performing minimal inhibitory con- 
centration assays and Jim McAlpinc, Rich Summers, and Thomas 
Vanden Boom for helpful discussions. This work was supported in part, 
by Grant GM50542 (to K.A.R.) from the National Institutes of Health. 

1. Cortes, J., Haydock, S. F., Roberts, G. A, Bcvitt, D. J. & Leadlay, 
P. F. (1990) Nature (London) 348, 176-178. 

2. Donadio, S. & Katz, L. (1992) Gene 111, 51-60. 

3. Oliynyk, M., Brown, M. J. B., Cortes, J., Staunton, J. & Leadlay, 
P. F. (1996) Chem. Biol. 3, 833-839. 

4. Kuhstoss, S., Hubcr, M., Turner, J. R., Paschal, J. W. & Rao, 
. R. N. (1996) Gene 183, 231-236. 

5. Ruan, X., Pereda, A., Stassi, D., Zeidner, D., Summers, R., 
Jackson, M., Shivakumar, A, Kakavas, S. & Katz, L. (1997) /. 
Bacteriol. 179, 6416-6425. 

6. Marsden, A. R A., Wilkinson, B., Cortes, J., Dunster, N. J., 
Staunton, J. & Leadlay, P. F. (1998) Science 279, 199-202. 

7. Kakavas, S. J., Katz, L. & Stassi, D. (1997) J. Bacteriol. 179, 
7515-7522. 

8. DeWitt, J. P. (1985) /. Bacteriol. 164, 969-971. 

9. Vara J. A., Lewandowska-Skarbek, M., Wang, Y.-G., Donadio, S. 
& Hutchinson, C. R. (1989)/ Bacteriol. 171, 5872-5881. 

lOi Maine, G., Post, D., Satter, M. & Stassi, D. (1997) International 
patent application WO 97/06266 (2/20/97). 

11. Yamamoto, H., Maurer, K. H. & Hutchinson, C. R. (1986) / 
Antibiot. 39, 1304-1313. 

12. Wallace, K. FC, Han, L., Lobo, S. & Renolds, K. K. (1997) /. 
Bacteriol. 179, 3884-3891. 

13. Bibb, M. J., White, J., Ward, J. M. & Jansscn, G. R (1994) Mol. 
Microbiol. 14, 533-545. 

14. Sambrook, J., Fritsch, E. F. & Maniatis, T. (1989) Molecular 
Cloning: A Laboratory Manual (Cold Spring Harbor Lab. Press, 
Plainview, NY), 2nd Ed. 

15. Fitzgerald, N. B., English, R. S., Lampel, J. S. & Vanden Boom, 
T. (1998) Appi Environ. Microbiol., 64, 1580-1583. 

16. Wallace, FC K., Bao, Z.-Y., Dai, H. Digate, R, Schuler, G., 
Speedic, M. & Reynolds, K. (1995) Eur. J. Biochem. 233, 954-962. 

17. Larty, P. A. & Faghih, R. (1993) in Recent Progress in the 
Chemical Synthesis of Antibiotics, eds. Lukacs, G & Ohno, M. 
(Springer, New York), p. 121. 

18. Gandecha, A. R, Large, S. L. & Cundliffe, E. (1997) Gene 184, 
197-203. 

19. Wcissman, K. J., Timoncy, M, Bycroft, M., Grice, P., Hancfeld, 
U. t Staunton, J. & Leadlay, P. F. (1997) Biochemistry 36, 13849- 
13855. 



